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(57) ABSTRACT 

The present invention relates to proteins associated with T 
cell activation, termed TCAPs (T Cell Activation-associated 
Proteins), TCAP-encoding genes and nucleic acid derived 
therefrom, and methods for identifying TCAP-encoding 
genes. The method provides amino acid sequences of the 
TCAPs TA-GAP, TA-GPCR, TA-PP2C, TA-NFKBH, TA- 
KRP, TA-WDRP, and TA-LRRP, and nucleotide sequences 
of the genes encoding them, and nucleic acid derived 
therefrom, as well as amino acid and nucleic acid derivatives 
(e.g., fragments) thereof. The invention further relates to 
fragments (and derivatives thereof) of particular TCAPs that 
comprise one or more domains of a TCAP. Antibodies to 
TCAPs, and to TCAP derivatives, are additionally provided. 
Methods of production of the TCAPs, derivatives, e.g., by 
recombinant means, are also provided. Therapeutic and 
diagnostic methods and pharmaceutical compositions are 
provided. In specific examples, isolated TA-GAP, 
TA-GPCR, TA-PP2C, TA-NFKBH, TA-KRP, TA-WDRP, 
and TA-LRRP genes from human, and the sequences 
thereof, are provided. 
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| | | | |---- ---| 

1 GACATAGCTGCCCTAAAAGGAATGAGGAAGCGAGAGCTCTCCAGTGTCTGGCTGGCTCCG 60 



61 TCCGTGTGACAGCCCATGATGTTCTTTCCGGTCTCTGTAATATTCTGAATTTCCACCTGC 12 0 

1 | 1 1 1 1 

121 CCGCCCCTTCGCTTATAATGCAGAGCATGTGAAGGGAGACCGGCTCGGTCTCTCTCTCTC 18 0 

1 1 1 1 1 1 

181 C C AGTGG AC TAG AAGGAG C AG AGAGTT ATG CTGTTTCTCCCATT CT TT A C AG C T C A C CGG 24 0 

| | | | | | 

241 ATGTAAAAGAA CT C TGG C TAGAGA C C C T C CAAGGA CAGAGG CACAG CCAC AC GGGAGTGA 3 00 

1 1 1 1 1 1 

301 AATCCACCCCTGGACAGTCAGCCGCAATACTGatgaagctgagaagcagccacaatgctt 3 60 
1 MKLRSSHNAS10 



3 61 caaaaacactaaacgccaataatatggagacactaatcgaatgtcaatcagagggtgata 42 0 
11 KTLNANMMETLI ECQSEGD I 30 

| | | | | | 

421 tcaaggaacatcccctgttggcatcatgtgagagtgaagacagtatttgccagctcattg 48 0 
31 KEHPLLASCESEDS ICQLIE 50 

| | | | | | 

4 81 aagttaagaagagaaagaaggtgctgtcctggccctttctcatgagaaggctctcccctg 54 0 
51 VKKRKKVLSWPFLMRRLSPA70 

| | | | | | 

541 catcagatt tt tctggggct ttggagacagact tgaaagcatcgctat ttgatcagccct 600 
71 SDFSGALETDLiKAS LFDQPIj 90 

| | | | | | 

601 tgtcaattatctgcggtgacagtgacacactccccagacccatccaggacattctcacta 66 0 
91 SIICGDSDTLPRPIQDILTI 110 

| I I i I I 

6 61 ttctatgccttaaaggcccttcaacggaagggatattcaggagagcagccaacgagaaag 72 0 
111 LCLKGPSTEGI FRRAANEKA 130 



FIG. 1A 
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I | | | | | 

721 cccgtaaggagctgaaggaggagctcaactctggggatgcggtggatctggagaggctcc 78 0 
131 RKELKEEIiNSGDAVDLERIiP 150 

I I I I I I 

781 ccgtgcacctcctcgctgtggtctttaaggacttcctcagaagtatcccccggaagctac 84 0 
151 VHLLAVVFKDFLRS I PRKLL 170 

| | | | | | 

841 tttcaagcgacctctttgaggagtggatgggtgctctggagatgcaggacgaggaggaca 9 00 
171 SSDLFEEWMGALEMQDEEDR 190 

| | , | | | 

901 gaatcgaggccctgaaacaggttgcagataagct cccccggcccaacctcctgctac tea 94 0 
191 IEALKQVADKLPRPNLLLLK 210 

I I I I I I 

961 agcacttggtctatgtgctgcacctcatcagcaagaactctgaggtgaacaggatggact 102 0 
211 HLVYVLHLISKNSEVNRMDS 230 

| | | | | | 

10 21 ccagcaatctggccatctgcattggacccaacatgctcaccctggagaatgaccagagcc 10 8 0 
231 SNLAICIGPNMLTLENDQSL 250 

| | | | | | 

10 81 tgtcatttgaagcccagaaggacctgaacaacaaggtgaagacactggtggaattcctca 114 0 
251 SFEAQKDLNNKVKTLVEFLI 270 

| | | | | | 

1141 ttgataactgctttgaaatatttggggagaacattccagtgcattccagtatcacttctg 12 0 0 
271 DNCFEI FGENI PVHSS ITSD 290 

| | | | , | 

12 01 atgactccctggagcacactgacagttcagatgtgtcgaccctgcagaatgactcagcct 12 6 0 
291 DSIjEHTDSSDVSTLQNDSAY 310 

| | , , | | 

1261 acgacagcaacgaccctgatgtggaatccaacagcagcagtggcatcagctctcccagca 13 0 0 
311 DSNDPDVESNSSSGISSPSR 330 

| | | | , | 

1321 ggcagccccaggtgcccatggccacagctgctggcttggatagcgcgggcccacaggatg 13 6 0 
331 QPQVPMATAAGLDSAGPQDA 



1381 cccgagaggtcagcccagagcccattgtgagcaccgtggccaggctgaaaagctccctcg 14 00 
351 REVSPEPIVSTVARLKSSLA 370 



FIG. IB 
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I | I I I I 

1441 cacagcccgataggagatactcagagcccagcatgccatcctcccaggagtgcctcgaga 15 00 
371 QPDRRYSEPSMPSSQECLiES 390 

1 1 1 1 1 1 

15 01 gccgggtgacaaaccaaacactaacaaagagtgaaggggacttccccgtgccccgggtag 15 60 
391 RVTNQTLTKSEGDFPVPRVG 410 

| | | | | | 

1561 gctctcgtttggaaagtgaggaggctgaagacccatttccagaggaggtct tccctgcag 162 0 
411 SRLESEEAEDPFPEEVFPAV 430 



1621 tgcaaggcaaaaccaagaggccggtggacctgaagatcaagaacttggccccgggttcgg 168 0 
431 QGKTKRPVDLKIKNLAPGSV 450 

| | , | | | 

16 81 tgctcccgcgggcactggttctcaaagccttctccagcagctcgctggacgcgtcctctg 174 0 
451 LPRALVLKAFSSSSLDASSD 470 

1 1 1 1 1 1 

1741 acagctcgcccgtggcttctccttccagtcccaaaagaaatttcttcagcagacatcagt 18 0 0 
471 SSPVASPSSPKRNFFSRHQS 490 

| | | | | | 

18 01 ctttcaccacaaagacagagaaaggcaagcccagccgagaaattaaaaagcactccatgt 18 6 0 
491 FTTKTEKGKPSRE I KKHSMS 510 

| | | | | , 

18 61 ctttcacctttgcccctcacaaaaaagtgctgaccaaaaacctcagcgcgggctctggga 192 0 
511 FTFAPHKKVLTKNLSAGSGK 530 

| | | | | | 

1921 aatcgcaagactttaccagggaccacgtcccgaggggtgtcagaaaggaaagccagcttg 198 0 
531 SQD FTRD HVP R-GVR KE S Q L A 550 

| | | | ] | 

1981 ccggccgaatcgtgcaggaaaatgggtgtgaaacccacaaccaaacagcccgcggcttct 204 0 
551 GRIVQENGCETHNQTARGFC 570 

I I | | | | 

2041 gcctgagaccccacgccctctcggtggatgatgtgttccagggagctgactgggagaggc 2100 
571 LRPHALSVDDVFQGADWERP 590 

| | | | | | 

2101 ctggaagcccaccctcttatgaagaggccatgcagggcccggcagccagactagtggcct 216 0 
591 GSPPSYEEAMQGPAARLVAS 610 



FIG. 1C 
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2161 ccgagagccagaccgtggggagcatgacggtggggagcatgagggcgaggatgctggagg 22 2 0 
611 ESQTVGSMTVGSMRARMLEA 630 

1 1 , 1 1 1 

2 221 cgcactgcctcctaccccctcttccacctgctcaccacgtagaggactcaagacacaggg 228 0 
631 HCLiLPPL P PAHHVEDSRHRG 650 



22 81 gcagcaaagagccactccctggccacggactctctcccctgcctgagcgatggaaacaga 2 34 0 
651 SKEPLPGHGLS PLPERWKQS 670 



2341 gcagaactgtccatgcttctggggactctctggggcacgtgtctggcccagggagacctg 24 0 0 
671 RTVHASGDSLGHVSGPGRPE 690 



24 01 agctcctcccgctgaggaccgtctccgagtccgtgcagaggaataagcgggactgtctcg 24 6 0 

691 LLPLRTVSESVQRNKRDCLV 710 

| | | | | | 

24 61 tgcgacgatgtagccagccggtctttgaggctgaccaattccaatatgccaaagaatcgt 2 52 0 
711 RRCSQPVFEADQFQYAKESY 730 

| , | | | | 

2521 atatttagGAGGGAGGCCATACGCCATGCCATAGCTTGTGCTATCTGTAAATATGAGACT 2 580 

731 I * 731 

1 1 1 1 1 1 

25 81 TG T AAAGAA C TGC C T GTAGAT TGTTTTT AAAAGGT C T TGAATAAG CTCCTTG AG AAAG T T 2 64 0 



2 641 GTGGAAAGCC CT C CT CAGTGAGGATAGCTACACCATGGCCATGGCGCATCAGATAGT CT C 2 7 00 

1 1 1 1 , 1 

2 7 01 TGTGTACCTGGATTTGTGCAATATGTAAAAATGTATCAAATGTATTATAGATAAGGTGTT 2760 

| | | | | | 

2761 AGGTGCAAAGGATGTCTAATAATCCCTGCACACGTTTTGAACTTGC AGTGAAGTACACTG 2 82 0 

| | | | | | 

2 821 CTGTTCCTTGCTTCCTGGGGCACTTTTCTCTTGGTTAGTGTTTAAAAATTATCTTCGCTT 2 8 8 0 



FIG. ID 
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2881 TTTTAATGTGGCCTCAAATGTCATGCCAATTTTCACATCTTCCACAAACTCCATTTAGGG 2 94 0 

| | | | , | 

2 941 AGAAATGTTTAAATCTCTGGTATAAGTTTACTCCATACCAGAGTAAACTATATATTACTC 3 000 



3 001 TATATAAGCAGT C TTGCAATAAC T AATC AC C AC C ATAGAAGAAAGAAAC AGAC TG C AAGG 3 060 

| , | | | | 

3 0 61 AACAGAGTTGAGTGTCTGGAGTCATCAAAGGCATTAAAAA CT C CAGTAAAAGC TGGGGC C 312 0 

| | | | | | 

3121 GT AGC AAAAAT C ATGAAAAAC ACTTCAACGTGTC CTTTCAATC ATC CAATTAAATGTGGG 318 0 



I I I 

3181 TAGATTAATGAAAATGTATTACATCAATATTAACTCAT 3 218 



FIG. IE 
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I | | | | | 

1 GACATAGCTGCCCTAAAAGGAATGAGGAAGCGAGAGCTCTCCAGTGTCTGGCTGGCTCCG 6 0 

| | | | | | 

6 1 TCCGTGTGACAGCCCATGATGTTCTTTCCGGTCTCTGTAATATTCTGAATTTCCACCTGC 12 0 

| , | | | | 

121 CCGCCCCTTCGCTTATAATGCAGAGCATGTGAAGGGAGACCGGCTCGGTCTCTCTCTCTC 18 0 



| | ( | | | 

181 CCAGTGGACTAGAAGGAGCAGAGAGTTATGCTGTTTCTCCCATTCTTTACAGCTCACCGG 2 4 0 



I I I I I I 

241 ATGTAAAAGAACTCTGGCTAGAGACCCTCCAAGGACAGAGGCACAGCCACACGGGAGTGA 3 00 

| | | | | | 

3 01 7VATCCACCCCTGGACAGTCAGCCGCAATACTGATGAAGCTGAGAAGCAGCCACAATGCTT 3 60 



3 61 CAAAAACACTAAACGCCAATAATATGGAGACACTAATCGAATGTCAATCAGAGGGTGATA 4 20 



421 T C AAGGAAG AT C C C CTGTTGGCAT CATGTGAGAGTGAAGACAGTATTTG C CAGCT CATTG 4 80 



| | | | | | 

4 81 GACATTCTCACTATTCTATGCCTTAAAGGCCCTTCAACGGAAGGGATATTCAGGAGAGCA 54 0 

| | | | | | 

541 GCCAACGAGAAAGCCCGTAAGGAGCTGAAGGAGGAGCTCAACTCTGGGGATGCGGTGGAT 6 00 

| | | | | | 

601 CTGGAGAGGCTCCCCGTGCACCTCCTCGCTGTGGTCTTTAAGGACTTCCTCAGAAGTATC 660 



661 CCCCGGAAGCTACTTTCAAGCGACCTCTTTGAGGAGTGGatgggtgctctggagatgcag 720 
1 MGADEMQ7 



721 gacgaggaggacagaatcgaggccctgaaacaggttgcagataagctcccccggcccaac 780 
8DEEDR1 EALKQVADKLPRPN 27 



781 ctcctgctactcaagcacttggt ctatgtgctgcacct cat cagcaagaactctgaggtg 84 0 
28IiLLliKHL,VYVI J HL J ISKNSEV 47 



FIG. 2A 
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1 | | | | | 

841 aacaggatggactccagcaatctggccatctgcattggacccaacatgctcaccctggag 90 0 
48NRMDSSNLAICIGPNMLTLE 67 



901 aatgaccagagcctgtcatttgaagcccagaaggacctgaacaacaaggtgaagacactg 96 0 
68NDQSLSFEAQKDLNNKVKTL 87 



961 gtggaattcctcattgataactgctttgaaatatttggggagaacattccagtgcattcc 10 2 0 
88VEFLIDNCFEIFGENIPVHS 107 



1021 agtatcacttctgatgactccctggagcacactgacagttcagatgtgtcgaccctgcag 10 80 
108 SITSDDSLEHTDSSDVSTLQ 127 



10 81 aatgactcagcctacgacagcaacgaccctgatgtggaatccaacagcagcagtggcatc 114 0 
128 NDSAYDSNDPDVESNSSSGI 147 

| | | | | | 

1141 agctctcccagcaggcagccccaggtgcccatggccacagctgctggcttggatagcgcg 12 0 0 
148 SSPSRQPQVPMATAAGLDSA 167 

| | | , | | 

12 01 ggcccacaggatgcccgagaggtcagcccagagcccattgtgagcaccgtggccaggctg 12 60 
168 GPQDAREVS PEPIVSTVARL 187 

I | | ] | I 

12 61 aaaagctccctcgcacagcccgataggagatactcagagcccagcatgccatcctcccag 13 2 0 
188 KSSLAQPDRRYSEPSMPSSQ 207 



1321 gagtgcctcgagagccgggtgacaaaccaaacactaacaaagagtgaaggggacttcccc 13 80 
208 ECL.ESRVTMQTLTKSEGDFP 227 

1 1 1 1 1 | 

13 81 gtgccccgggtaggctctcgtt tggaaagtgaggaggctgaagacccattt ccagaggag 1440 
228 VPRVGSRLESEEAEDPFPEE 247 

| | | | | | 

144 1 gtcttccctgcagtgcaaggcaaaaccaagaggccggtggacctgaagatcaagaacttg 1500 
248 VFPAVQGKTKRPVDLiKI KNL 267 

| | | | | | 

15 01 gccccgggttcggtgctcccgcgggcactggttctcaaagccttctccagcagctcgctg 15 6 0 
268 APGSVLPRALVLKAFSSSSL. 287 

| , , | | | 

15 61 gacgcgtcctctgacagctcgcccgtggcttctccttccagtcccaaaagaaatttcttc 16 2 0 
288 DASSDSSPVASPSSPKRNFF 307 

| , | | | | 

1621 agcagacatcagtctttcaccacaaagacagagaaaggcaagcccagccgagaaattaaa 1680 
308 SRHQSFTTKTEKGKPSREI K 327 



FIG. 2B 
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I | | | | | 

1681 aagcactccatgtctttcacctttgcccctcacaaaaaagtgctgaccaaaaacctcagc 174 0 
328 KHSMSFTFAPHKKVLTKNLS 347 

I I I I I I 

17 41 gcgggctctgggaaatcgcaagactttaccagggaccacgtcccgaggggtgtcagaaag 18 00 
348 AGSGKS QDFTRDHVPRGVRK 367 

| | | | | , 

1801 gaaagccagc ttgccggccgaatcgtgcaggaaaatgggtgtgaaacccacaaccaaaca 18 60 
368 ESQLAGRIVQENGCETHNQT 387 

| | | | | | 

18 61 gcccgcggcttctgcctgagaccccacgccctctcggtggatgatgtgttccagggagct 192 0 
388 ARGFCLRPHALSVDDVFQGA 407 



192 1 gactgggagaggcctggaagcccaccctcttatgaagaggccatgcagggcccggcagcc 198 0 
408 DWERPGSPPSYEEAMQGPAA 427 



1981 agactagtggcctccgagagccagaccgtggggagcatgacggtggggagcatgagggcg 2 04 0 
428 RLVASESQTVGSMTVGSMRA 447 

| | | | | | 

2 041 aggatgctggaggcgcactgcctcctaccccctcttccacctgctcaccacgtagaggac 2100 
448 RMLEAHCLLPPLPPAHHVED 467 



2101 tcaagacacaggggcagcaaagagccactccctggccacggactctctcccctgcctgag 2160 
468 SRHRGS KEPLPGHGLSPLPE 487 

| | | | | | 

2161 cgatggaaacagagcagaactgtccatgctt ctggggactctctggggcacgtgtctggc 222 0 
488 RWKQSRTVHASGDSLGHVSG 507 



2221 ccagggagacctgagctcctcccgctgaggaccgtctccgagtccgtgcagaggaataag 22 80 
508 PGRPELLPLRTVSESVQRNK 527 



22 81 cgggactgtctcgtgcgacgatgtagccagccggtctttgaggctgaccaattccaatat 2 34 0 
528 RDCLiVRRCSQPVFEADQFQY 547 



2 341 gccaaagaa t cgtata tt t aGGAGGGAGGCCATACGCCATGCCATAGCTTGTGCTATCTG 2 4 0 0 
548 A K E S Y I 553 



2 4 01 TAAATATGAGACTTGTAAAGAACTGCCTGTAGATTGTTTTTAAAAGGTCTTGAATAAGCT 2 4 6 0 

j | | | , | 

2 4 61 CCTTGAGAAAGTTGTGGAAAGCCCTCCTCAGTGAGGATAGCTACACCATGGCCATGGCGC 2 52 0 



FIG. 2C 
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2 5 21 ATCAGATAGTCTCTGTGTACCTGGATTTGTGCAATATGTAAAAATGTATCAAATGTATTA 25 80 



2 5 81 T AG AT AAGGTGTT AG GT G C AAAGG ATG T C T AATAATC C CTG C ACACG TTT TG AAC T TGCA 264 0 



2 641 GTGAAGTACACTGCTGTTCCTTGCTTCCTGGGGCACTTTTCTCTTGGTTAGTGTTTAAAA 2 7 00 



2 7 01 ATTATCTTCGCTTTTTTAATGTGGCCTCAAATGTCATGCCAATTTTCACATCTTCCACAA 2 7 6 0 



2 7 61 AC TCCATTTAGGGAGAAATGTTTAAATC TCTGGTATAAGTTTAC TC CATAC CAGAGTAAA 2 82 0 

| | | j | | 

2 821 C T AT ATATTACT CT ATAT AAG C AG TC T TG C AAT AAC TAAT C A C C AC C ATAGAAGAAAG AA 2 88 0 



2 8 81 ACAGACTGCAAGGAACAGAGTTGAGTGTCTGGAGTCATCAAAGGCATTAAAAACTCCAGT 2 94 0 



2 941 AAAAGCTGGGGCCGTAGCAAAAATCATGAAAAACACTTCAACGTGTCCTTTCAATCATCC 3 00 0 



3 001 AATTAAATGTGGGTAGATTAATGAAAATGTATTACATCAATATTAACTCAT 3 051 



FIG. 2D 
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1 AGAGGCAGGCGGCTTGTGAGACGGGCTCCAGAGAAAGGACCTCCCTGGGTCTCTCATTTC 60 



| | | | | | 

6 1 CTGGCTGAAGTTTCTCTTCTCGCTGCTGTGGCAGCATCCAACCCAC AC ACACAGGACCCG 12 0 



121 CATCCTGGGTGATGAAGTCAGACACGCAGCAGCTGGGTGAGTGCTAACGCTCAGATAAGC 18 0 



181 ATCTGTGCCATTGTGGGGACTCCCTGGGCTGCTCTGCACCCGGACACTTGCTCTGTCCCC 24 0 



241 GCCatgtacaacgggtcgtgctgccgcatcgagggggacaccatctcccaggtgatgccg 3 00 
1 MYNGSCCRI EGDTI SQVMP 19 



301 ccgctgctcattgtggcctttgtgctgggcgcactaggcaatggggtcgccctgtgtggt 3 60 
20 PL.L I VAFVLGALGNGVALCG 39 

| | | | | | 

3 61 ttctgcttccacatgaagacctggaagcccagcactgtttaccttttcaatttggccgtg 42 0 
40FCFHMKTWKPSTVYLFNLAV 59 

| | | | | | 

421 gctgatttcctccttatgatctgcctgccttttcggacagactattacctcagacgtaga 48 0 
60ADFLLMICLPFRTDYYLRRR 7 9 

| | | | , , 

481 cactgggcttttggggacattccctgccgagtggggctcttcacgttggccatgaacagg 54 0 
80 HWAFGDI PCRVGLFTLAMNR 99 

| ) | | | | 

541 gccgggagcatcgtgttccttacggtggtggctgcggacaggtatttcaaagtggtccac 600 
100 AGS I VFLTVVAADRYFKVVH 119 

| | | | | | 

601 ccccaccacgcggtgaacactatctccacccgggtggcggctggcatcgtctgcaccctg 660 
120 PHHAVNT I S TRVAAG I VCTL 139 

| | | | | | 

661 tgggccctggtcatcctgggaacagtgtatcttttgctggagaaccatctctgcgtgcaa 72 0 
140 WALVI LGTVYLLLENHLCVQ 159 

| | | | | | 

721 gagacggccgtctcctgtgagagcttcatcatggagtcggccaatggctggcatgacatc 78 0 
160 ETAVSCESF IMESANGWHD I 179 

| | | | j | 

781 atgttccagctggagttctttatgcccctcggcatcatcttattttgctccttcaagatt 84 0 
180 MFQLEFFMPLGIILFCSFKI 199 



FIG. 3A 
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I 1 | | | 1 

841 gtttggagcctgaggcggaggcagcagctggccagacaggctcggatgaagaaggcgacc 90 0 
200 VWSLRRRQQLARQARMKKAT 219 

| | | | | j 

901 cggttcatcatggtggtggcaattgtgttcatcacatgctacctgcccagcgtgtctgct 960 
220 RFIMVVAIVFITCYLPSVSA 239 

| | | ] | | 

961 agactctatttcctctggacggtgccctcgagtgcctgcgatccctctgtccatggggcc 102 0 
240 RLYFLWTVPSSACDPSVHGA 259 

| | | | | | 

1021 ctgcacataaccctcagcttcacctacatgaacagcatgctggatcccctggtgtattat 108 0 
260 LiH I T L S FTYMNSMLDPLVYY 279 



1081 ttttcaagcccctcctttcccaaattctacaacaagctcaaaatctgcagtctgaaaccc 114 0 
280 FSSPSFPKFYNKLKI CSLKP 299 

| | | | | | 

1141 aagcagccaggacactcaaaaacacaaaggccggaagagatgccaatttcgaacctcggt 120 0 
300 KQPGHSKTQRPEEMPISNLG 319 

| | | | | | 

12 01 cgcaggagttgcatcagtgtggcaaatagtttccaaagccagt ctgatgggcaatgggat 12 6 0 
320 RRSC I SVANS FQSQSDGQWD 339 

| | | | | | 

12 61 ccccacattgttgagtggcact gAAC AAGCAGACCAACAACACTGAGGAAGAT AGAGTGG 1320 
340 P H I V E W H 346 

| | | | | | 

13 21 TGACTTAGAATTAACTCGTGCTAAGGGGTCGGGGGCTTTGAAAATGCCACCCCCCTTTCT 13 8 0 

| - • - | | | | | 

13 81 TATTGCAAGACGGCTTCTCGCACATGAACTGCATCCTTCTCATTCTGTCGGAAATGAAAT 1440 

| | | , , | 

1441 TCACACAACTATACCTTTTGGGGAGGTTCCAGTTGATTGAAGTGAGTTGGCTGCATTTTC 15 0 0 



15 01 TTATCTGATCACAATGGCAGGGGACAGAATGTGCATGGAGTGGAGCATGTGTGTGTTGGG 15 6 0 

| | | | | | 

1561 AGGGGGGCTAGGAACTGC ACAGCCCTTGTGTAATTTTCGTTGTTTGTTTTTGTTTTGAGA 162 0 

1 1 1 1 1 1 

1621 CAGAGTCTCACTCTGTGTCCCAGGCTGG AGTGCAGTGGCACAGTCTCGGCTCACTGCAAC 168 0 



FIG. 3B 
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I 1 | | | | 

16 81 CTCTGCCTCCCGGGTTCAAGCAATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGATTAGA 174 0 

1 1 1 _.| 1 1 

1741 GGCGCCAGCCAACACACCCGGCTAATTTTTGTATTTTTAGTAGAGACAGGGTTTTGCCAT 180 0 

| | | | | | 

18 01 GTTGGCCAGGCTGGTCTCGAGCTCCTGACCTCAGGTGATCCGCCTGCCTTGGCCTCCCAA 186 0 

| | , | | | 

18 61 AGTGGTGGGATCACAGGCGTGAGCCACCGTGCCCGGCCTCCCCTGTGTCATTTTAAATGG 192 0 

| | | | | | 

1921 CTAAGTAAATGGGTATATGTGTTTGAATGGGGCATGTTCACTCTCTTAGGGGCTATGGGG 198 0 

| | | | | | 

19 81 C AGTTAGCAGCATTTCCTATCCTCTGACCTTAAATCATTCCTTATCTCAGAAAACAGAAA 2 04 0 

| | | | | | 

2 0 41 CCGGGCTCAGTCAATCA ATGCTTTATTTCAGGCCGAATGAGGCTCTTTAGATTGGGATCT 210 0 

| | | | | | 

2101 ATTGATCTATCAATTTTCATCTTTACATTTCTTTGTACATCTGTACATTTTGTCCAAATG 2160 

| | | | | | 

2161 TACATCTGTACGTCTGTCATCATTGTGACTTCCTGGTAGCCCAAGAAGAACAACAACAAA 22 2 0 



2 221 ACAATCTGCTCTGACCTTCTTCAAATCTTTGTATTTCAAAGAAGGTGCTGAGGGATCTGT 22 8 0 

| | | | | | 

2281 TTCCTTGCCCTGGCTTCTCCAGTGGGATGTGCTGAGTCCAATACAATTGCTTTTATAATT 234 0 

| | | | | | 

2 3 41 GCTTTTGACAACTTGTCATGTGACTGTGAATTGAAATTATTCACTTATTTTCCAAGTATT 2 4 0 0 

1 1 1 1 1 1 

24 01 TACTGAATTCGTATTTGGTGGCAGGCAGTATACTGTGTAATTTTTAGTGGAGGGTCATTA 24 60 

| | | | | | 

24 61 GTCAACTCTTATGTGACAGTAAAGTTTTTTGGGGGGGTGGGGACAGAGAAGTTAAGAGCT 252 0 



FIG. 3C 



Patent Application Publication Jun. 26, 2003 Sheet 13 of 55 US 2003/0119024 Al 



1 1 1 1 | | 

2 521 TTC ATCCTTTCACGGAATACAGTTTCTAGACCGATTCTGTGTGAACATCAGTTTTGTCCT 2 5 8 0 



| | | | | | 

25 81 CTTATTGCAAGACTCCCTCATACACATGAGTTTCCCAAATGTGTACCTGGACCCCTCGAA 2 64 0 



2 641 AC AG AGG ACT C T ACGAAATGAC AGG CTGGCCCTGCC CT G AATT AGGGGG AAAC AT TC C AG 2 7 00 

| | , | | | 

2 7 01 GCCAACTCTAGCTCCTTTCTCAAGCTACAAAGTGGTGAACATGGTTCTCAACTCCTTAAT 2 76 0 

| | | | | | 

2 761 TTATA CTCTCTC AAATGC CCAGG ATAC T CTACC CACTTAAGAAC CTTGC CAACTTC TGGG 2 82 0 

| | j | | | 

2 8 21 GGTTGGGCATGGTGGCTCGCGCTTGTGATCCCAGCACTTTGGGAGACTGAGGCGGATCAC 2 8 8 0 

| | | | | | 

2 8 81 CTGAGGTCAGGAGTTCTAGACCAGCCTGACCGACATGGAGAAACCTCGTCTGTACTGAAA 2 94 0 



2 941 ATTCAAAATTAGCCTGGTGTGGTGGCGCATGCCTATAGTCTCAGCCTCCAGAGTAGCTGG 3 0 0 0 

| | | | | , 

3 0 01 GACTGCGGGCGCCCCACCACCACGCCCGGCTAATTTTTTGTATTTTTAGTACAGACGGGG 3 0 6 0 

""-I " - I " ------- | | | | 

3 061 TTTCATTGTGTTAGCCGGGATGGTCTTGATCTCCTGACTTGTGATCCGCCTGCCTCGGCC 312 0 

| ,„„ | , , | 

3121 TCCCAAAGTGCTTGGATTACAGGTGTAAGCCACCGCACCCCGCCCAGCCTGGCAGATTTT 318 0 

| | | | | | 

3181 ATTTAATCATTTGTAGCTTCATTTTCCTCGTCTGTCAAACAGGGATACTGTAATACAACC 3 2 4 0 

1 1 1 1 1 1 

324 1 TCAGTGTGTCATTGGGCAGTTTAAATGAATGTACATTCCTGAGGCATCAGAACTTTGTTC 3 3 00 

| | | | | | 

3 3 01 ACTGTTATATACCCAATGCCTAGAAGAGGACCTGCACATAGCAGGTGCTCAGTAAATGTT 33 6 0 



FIG. 3D 
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I 1 | 1 | | 

33 61 TGTTGAATGAATGATTAAGTGCATGTAAAGCATTAAGCATAGCGCCTGGCAGTAAGTGCT 34 2 0 

| | | | | | 

3 421 C AAT AT T A TGAC T T C TT AT AT T AAC AC GT T TT A C AT AT AAAG AAAT GG AGG C AAG AAAGC 34 8 0 

| | | | | | 

34 81 ATTTCCTTTGGGGTTTAGAGCGCTTAAGTTGTTCCTCTGTTATCATGCCTGAATTCCCCC 3 54 0 



3 541 GC CC CT C AGTT AC CTGGGGAAGAGTAAAGGC AAGAATT CTT ACC AGC AT TAGTC ATAC AT 3 6 0 0 



3 601 CCTCCTGATAGG 3 612 



FIG. 3E 
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1 AGAGGCAGGCGGCTTGTGAGACGGGCTCCAGAGAAAGGACCTCCCTGGGTCTCTCATTTC 60 



| | | | | | 

6 1 CTGGCTGAAGTTTCTCTTCTCGCTGCTGTGGCAGCATCCAACCCACACACACAGGACCCG 120 



| | | | ( | 

121 CATCCTGGGTGATGAAGTCAGACACGCAGCAGCTGGGTGAGTGCTAACGCTCAGATAAGC 180 



| | | | | | 

181 ATCTGTGCCATTGTGGGGACTCCCTGGGCTGCTCTGCACCCGGACACTTGCTCTGTCCCC 24 0 

| | | | | | 

24 1 GCCatgtacaacgggtcgtgctgccgcatcgagggggacaccatctcccaggtgatgccg 3 0 0 
1 MYNGSCCRIEGDTI SQVMP 19 

1 , 1 1 1 1 

3 01 ccgctgctcattgtggcctttgtgctgggcgcactaggcaatggggtcgccctgtgtggt 36 0 
20PLLIVAFVLGALGNGVALCG 39 

| | | | | 1 

3 61 ttctgcttccacatgaagacctggaagcccagcactgtttaccttttcaatttggccgtg 42 0 
40 FCFHMKTWKPSTVYL FNLAV 59 

| | | | | | 

421 gctgatttcctccttatgatctgcctgccttttcggacagactattacctcagacgtaga 48 0 
60 ADFLLMI CliPFRTDYYLRRR 79 

| | | | | | 

481 cactgggctt ttggggacattccctgccgagtggggctcttcacgttggccatgaacagg 54 0 
80 HWAFGDI PCRVGLFTLAMNR 99 



541 gccgggagcatcgtgttccttacggtggtggctgcggacaggtatttcaaagtggtccac 60 0 

100 AGS I VFL.TVVAADRY FKVVH 119 

1 | 1 | 1 1 

601 ccccaccacgcggtgaacactatctccacccgggtggcggctggcat cgtctgcaccctg 6 60 

120 PHHAVNTI STRVAAGIVCTL 139 



661 tgggccctggt ca t cc t gggaacagt gt a tct tttgctggagaac cat etc tgcgtgcaa 72 0 
140 WALVI LGTVYLLL ENHLCVQ 159 



72 1 gagacggccgtctcctgtgagagcttcatcatggagtcggccaatggctggcatgacatc 7 8 0 

160 ETAVSCES F IMESANGWHDI 179 

| | | | | | 

781 atgttccagctggagttctttatgcccctcggcatcatcttattttgctccttcaagatt 84 0 

180 MFQLEFFMPLGI ILFCSFKI 199 



FIG. 4A 
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I 1 1 1 1 1 

8 41 gtttggagcctgaggcggaggcagcagctggccagacaggctcggatgaagaaggcgacc 9 00 
200 VWSLRRRQQLARQARMKKAT 219 

1 1 1 1 1 1 

901 cggttcatcatggtggtggcaattgtgttcatcacatgctacctgcccagcgtgtctgct 960 
220 RFIMVVAIVFITCYLPSVSA 239 



961 agactctatttcctctggacggtgccctcgagtgcctgcgatccctctgtccatggggcc 102 0 

240 RLYFLWTVPS SACDPSVHGA 259 

-—I i I I I I 

1021 ctgcacataaccctcagcttcacctacatgaacagcatgctggatcccctggtgtattat 108 0 

260 LHITLSFTYMNSMIiDPltVYY 279 

| | | | | | 

1081 ttttcaagcccct:cctttcccaaattctacaacaagctcaaaatctgcagtctgaaaccc 114 0 

280 FSSPSFPKFYNKLKICSLKP 299 

| | | | | | 

1141 aagcagccaggacactcaaaaacacaaaggccggaagagatgccaattt cgaacctcggt 12 0 0 

300 KQPGHSKTQRPEEMPI SNLG 319 

""-I I I I I I 

1201 cgcaggagttgcatcagtgtggcaaatagtttccaaagccagtctgatgggcaatgggat 1260 

320 RRSC I SVANS FQSQSDGQWD 339 

| | , , | | 

1261 ccccacattgttgagtggcact G AAC AAGC AG AC CAAC AAC ACT GAGGAAGAT AGAGTGG 1320 

340 P H I V E W H 346 

| | | | | | 

13 21 TGACTTAGAATTAACTCGTGCTAAGGGGTCGGGGGCTTTGAAAATGCCACCCCCCTTTCT 13 8 0 



1381 TATTGCAAGACGGCTTCTCGCACATGAACTGCATCCTTCTCATTCTGTCGGAAATGAAAT 144 0 

I "I I I I | 

1441 TCAC ACAACTATACCTTTTGGGGAGGTTCCAGTTGATTGAAGTGAGTTGGCTGCATTTTC 150 0 

| | | | | | 

1501 TTATCTGATCACAATGGCAGGGGACAGAATGTGCATGGAGTGGAGCATGTGTGTGTTGGG 156 0 

| | | | | | 

1561 AGGGGGGCTAGGAACTGCACAGCCCTTGTGTAATTTTCGTTGTTTGTTTTTGTTTTGAGA 16 2 0 

I I I I I I 

1621 CAGAGTCTCACTCTGTGTCCCAGGCTGGAGTGCAGTGGCACAGTCTCGGCTCACTGCAAC 168 0 



FIG. 4B 
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1681 CTCTGCCTCCCGGGTTCAAGCAATTCTCCTGTCTCAGCCTCCAGAGTAGCTGGGACTACG 174 0 

""I I I I I I 

1741 GGCGCCCCACCACCACGCCCGGCTAATTTTTTGTATTTTTAGTACAGACGGGGTTTCATT 18 0 0 

I I I I |— I 

18 01 GTGTTAGCCGGGATGGTCTTGATCTCCTGACTTGTGATCCGCCTGCCTCGGCCTCCCAAA 18 60 

| | | | | | 

18 61 GTG C TTGGATTAC AGGTGTAAG CCACCGC AC C C CG C CCAGCCTGG C AG ATTTTATTTAAT 192 0 



1921 CATTTGTAGCTTCATTTTCCTCGTCTGTCAAACAGGGATACTGTAATACAACCTCAGTGT 198 0 

| | | | | | 

1981 GTCATTGGGCAGTTTAAATGAATGTACATTCCTGAGGCATCAGAACTTTGTTCACTGTTA 2 04 0 

| | | | | | 

2 041 TATA C C C AAT G C C T AG AAG AGG AC C T G C AC AT AG C AGGTG CT C AG T AAAT G T T TG TTG AA 2100 

I I I i | ( 

2101 TGAATGATTAAGTGCATGTAAAGCATTAAGCATAGCGCCTGGCAGTAAGTGCTCAATATT 2160 

| | | | | | 

2161 ATC ACTT CT T ATATTAACACGTTTTACAT ATAAAG AAATGG AG G CAAGAAAG CATTTC C T 222 0 

| | | | | | 

2221 TTGGGGTTTAGAGCGCTTAAGTTGTTCCTCTGTTATCATGCCTGAATTCCCCCGCCCCTC 22 8 0 



22 81 AGTTAC C TGGGG AAG AGTAAAGG C AAG AATT CTTAC CAG CAT T AG T C ATA CATC C T C C T G 23 4 0 



2341 ATAGG 2345 



FIG. 4C 
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1 1 1 1 J 1 

1 GGCTGAGGCGGAGGAGCCGCCGCTTCTGACCTCGCTCTGGCTCCGGTGCGCGCGGCTGAG 60 



| I 1 | 1 I 

61 CGCGTGCGAGGCCCCGCGCGTCGGGCAGGGGCGGCGGCGGCCACTGCGCGCCGCCCTGAG 120 

| |- | | I I 

12 X GAGCGCCCCAGCGGCGGCGCGACTGCGGCTGAGGAGAGAGCCGGCTCCGGGCCTCCGCGT 180 



1 1 1 1 1 1 

181 CCTCCTGCTCCCCCGGCCCCCCGCCTCCTCGGGGGGGCGGCGGCGGCGatgttctcggtc 240 
X M F S V 4 

-| J | I I I 

241 ctctcgtacgggcggctggtggcccgcgccgtgctcggcggcctctcgcagaccgacccc 3 00 
5LSYGRLVARAVLGGL.SQTDP 24 

,: , 1 1 —i 1 

301 agggccggcggcggcggcggcggcgactacggactggtgacggccggctgcggcttcggg 36 0 
25RAGGGGGGDYGLVTAGCGFG 44 

—I 1 1 1 1 1 

361 aaggacttccgtaagggcctcctcaagaagggcgcgtgctacggggacgacgcgtgcttc 420 
45KDFRKGliLKKGACYGDDACF 64 

, | | | L 1 

421 gfcggcccggeaccgttccgcggacgtgctcggggttgcagatggtgtaggaggctggaga 4B0 
65 V ARHRS ADV I»GVADGVG GWR 84 

- I I I" I— I I 
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481 gactatggagttgatccatctcaattctcagggactttaatgcggacgtgtgaacgttfca 540 
8SDYGVDPS QF S GTLMRT CEE.L X04 

, , | I 1 I 

541 gtaaaaLgaaggacggttcgtacctagtaatcccattggaattctcaccacaagctactgt 6 00 
105 VKEGRFVPSNPIGIIjTTSYC 124 

I -I I 1 1 I 

601 gagttgctgcaaaataaagtccctttgctcggtagcagcaccgcctgaattgtggtgctg 660 
125 EXiIiQHXVPXtliGSSTACXVVXi 144 

-J , , J 

661 gacagaaccagccaccgcttacacacagcaaacctgggcgattcaggcttcctggttgtc 720 
145 D R T S H R 1« H " T A N L. G D S G F 1» V V 164 

j ! ] , , 1 

721 aggggtggtgaagtcgtgcaccgatcagatgagcagcagcattacttcaacactccattc 780 
165 RGGEVVKRS D EQQHYFNTPF 184 

| | — 1 I""- -I" 1 

781 cagctctcaatcgctccccctgaagccgagggagtcgtcttgagcgacagtccggatgct 840 
185 Q li S IAP P E A E G V V L SD S PDA 204 

I— 1 —I I """I I 

841 gctgatagcacgtctttcgatgtccagctaggagacattatcctgacggcaacagatgga 900 
205 ADSTSFDVQLGDIILTATDG 224 

J | | -1 | | 

901 ctctttgacaacatgcctgattatatgattcttcaggagctaaaaaagfctaaagaattca 960 
225 LFDNMPDVMILQELKKLKNS 244 

„. | \ ! , ^ J 

961 aattatgagagtafcacaacagactgccagaagcattgctgagcaagctcatgagctggcc 1020 
245 NYES IQQTARS lAEQAHEIaA 264 

1 1 j ( 1 1 

1021 tatgacccaaattatatgtcaccttttgcacagtttgcatgtgacaatggattgaatgtg 1080 
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265" YDPNYMS PFAQFACDNGLNV 284 

| | 1 | 1 I 

1 0 8 X agaggtggaaagccagatgacatcaccgtccttctttcaatagtggctgagtatacagac 1140 
285 R .GGKPDDITVLLS IVAEYTD 304 

, ) | | 1 | 

1X4X t. a.cy CTG AG GTGTCAAGTC CTG CCTTTC C TTTCATCAT C C C AAAT TTCCCCTGCCATGTGT 1200 

1 1 1— - I 1 1 

1201 GCTGATCCTGCTGGCAGGACCACATTTCTTTGCCACTGR.TCTCAATGGCCAGTGATGTAA 1260 



I I I 1 I I 

1261 GTCTTTTGCCTGTCTTCTTGAGACTCGTTGAGATCTTTGTTGAGAACCACTACTATCATT 1320 

| | | J 1 | 

1321 CACTAGCTCATATCTGCCGGCAGCAATTGAAGAGATCCAATATTTGAAGATTGGCCTTCA 138 0 



| | \ | | 1 

13 81 TTTCTCGATGTTCTTTCCATGATGGGGATGGAGGTGTTCAGTGCCACCGTGGCTGTTACT 1440 



— I 1 1 1 1 1. 

1441 TTTCAAAGTAGTTGAAGTATTGAAAATGAGTAATGTTGGTAAAGTGAATTCAAAATCCTA 1500 



, | | | | | 

1501 GTATGCTAAAGGGATGGTACAAGTCTAACACAAATTGTACGTAATGATACATCTACTAGA 1560 



|- 1 *— -1 I— 1 — -i 

1561 AACATACATTATTCATCAAAAGAAATGTTACATGTGTACTCCACAGGCATAGTCTTTGTT 1620 



F16-. 5 C 



Patent Application Publication Jun. 26, 2003 Sheet 21 of 55 US 2003/0119024 Al 



, 1 J J 1 1 

1621 ATGATGATTGGTGTGGCTTTATGTCTTTGTTATAAACTCCTATTTTTCAGGGGCTTATGA 16 8 O 

1 1 -—i 1 1 1 

1681 TTCTGCTCTAARACRTTGCTCTGGGTTATACAGTTTTGATCCCAftAAGCTTTTTTGTTAC 174 0 

1 1 ( 1 1 1 

1741 AAATCGGGAGAAAAATCCA.TTTTAGTTCTATGGATGGAAATATTTCATGCTTTTAAAAAG 1800 

I— "I-" I — I I I 

1801 ATGTTTGTGTTCCTGTGGTTAAAGTTTTGGCAGTTTATTGATTAGTCCAAATCACAGGCT 18 6 0 

1 1 1 j 1 | 

1861 AAGGCCTGATCTCCAGGAGGGGTAGGGGAGACACTTTACCAGTATTTTTTTATGGAAATA 192 0 

I I "I t — i I 

1921 ATACTCAAGGTTGTAAAACCCCXCAAAGCCTAGAAATTTAATTGTTATGGCTGAAATTCC 19 80 

| | | | | — \ 

1981 TCCTAGTTGTCTGATAGAATGCCCCTGAATGGGAACTCTAGGTCCCAAGGCCTGAAGGQT 2040 

__._ ( | l__.__.___J | | 

2041 TGAGAACAGACAGCTGTAACTTTGAATTTTGTTGGCTTTCAGTGGTCATGCTACCTACCC 2100 

1 1 1 1 ! 1 

2101 ATACTCGTACTCTC AGACCTTTT A'fTAGTAGCCTTGCTTTCTATAGAGCATGCACCAAAT 2160 
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1- — I-- I 1 1 I 

2X61- CCAGTGAOTCCATGTGGAGAGAGCACTGTGTGCGCAGCGGCAGCAGCACAGACGTCCATG 2220 

1 1 1 \ 1 1 

2221 AGGAAAACTCCCAGTGATGATCTGACATTTACAATTACCCCACATGGAAA.TTTAGGGGTT 228 0 

1 1 1 1 1 1 

2281 TCTGAATCAAGCTTAATGTXTACAGTTTCCAAATAGCCATTTTGCAGTGTATAGTTTCCT 2340 

J J | | | i 

2341 TACAAAACTACC C CGC ATTC AGTTTTCAC ATTATCTGCAAGCTGAAACTT ATTTTTAAGT 2400 

| | J | | | 

2401 TTTGTGTACAAGTTGACTGCTGTAAAGAT ATATATTTTTGGGTC AGTTTTTTTCCTTCAT 246 0 

1 „ 1 1 1 I 1 

2461 TAACTTGGTGGTAGAAAAAAATATATACTTAGAAATCCTTAAATTPlAAGCCATGTTTTAT 2520 

1 ! 1 1 1 1 

2521 ATft.TAAGTCAGGTAACATTGGTGTA.TAGAT(3AGAATGCAATTAAACCTGATGA.QAA.TCTA 2580 

— - I --! 1 1 --1 "I 

2581 CTTGAGAATATAGAAAGTCTTTCTCTAAAGGAGATACTGACTCCCTGGTTTATTGCATTA 25 40 

| 1 | — -| I i 

2641 AAATTTATGTTTGAGGTTACCTCAACTTGTTTTAAAAGATTTTGTTTTGTGAATTTGTAC 2700 

» 

I 1 1 1 1 1 
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2701 TGTATATTTGRGTAACTGTCAGGCTTTTATrTAAAATTGTTTAACATGTACCA.T<3TACAT 2760 



I I — "I I — -I "I 

2761 GTCATTACTATATTTCAATGC ATC ATGCTTGTAACAGGC ATTTCATTT ATAATAAGAATG 2820 



J | | | | | 

2 821 AGTTATTCATTTGTAAGCCGTTCAGTAATTTATCTACTATTCCTAAATTGGCATAATGTT 2 880 



( | , , | I 

28 81 AGATAA.TCTATTTTGAATCACCTTTAATTACATGTCAGAATGCCTTAACTACCCTAACTT 2 940 



, 1 1_ 1 :_, 

2941 GACAAAACAGAATTCTTTGGTAGACGCGGTGGGGGCGGGGTGGGGGGTCTGGACGGAGTC 3000 



i -I -I 1 1 1. 

3001 TCTATTTAAGGAGflAATCATCATGCTATGATAAAACACAGAAGCATGAGTGGCAAGTGGC 30 SO 



| J J i | 1 

3061 GGGGTATTTATTTTGCACAAACTATTTGCAGTCTCTGTGTATTTAAAAAGTAAAGAAAGT 3120 



| | | | | -| 

3121 TGCATCCAGAAGGGTTTTGTTAGAATGAATACATTTATATTAGGACTGACAACTTCAGCT 3180 



-I — I 1 1 --I I . 

3181 CTTTTGTTTAGGTTTTCAATTATTTTTGGTAAGAGTATGTAGCCTTATGATCTGGATATA 3240 



1 |____.__..| |. 

3241 TTTTGCATTCATTTTCCAACGCCTACATTTAATTCC*rGGTAAGAGCAGTGCTCGTCAAGT 3300 



Fl<3* . 5F 
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1 1 i 1 — I 1 

3 3 01 TTCTGGTTTTTCTCTGCTCTCATTTAACCCGTCAAACACAATCTTTGTAAAGCTA.GATTG 33 SO 

I" "I I I I I 

3361 GTGGTGTTTTATACAAC^TATTTACTCAGCTTACCTTTTTGAGAAACGATTGTTAGAAAT 342 0 

1 , 1 1 1 1 

3421 TGACGATGTGTTTGTTCCAGTGATACTGAAAGTAGTGGGGGCAAGAAT-XGAGTTTCACAG 3 48 0 

| |_*_ 1 | J | 

3481 TGGAATTGGCTTTGGATCTGGCCTATAGATTAGTGACATAAAATATTTTCTCTATTTTCC 354 0 

1 1 j 1 , 1 

3 541 CCTGTTCTTTTTGTGTT ATGCACTTAATTTTATGACTGCCGGGGGGGTCAGCTGG AGTGC 3600 

1 --I 1 1 1 

3601 TGCTTAACAAGTATCTCTCCTACTCTCAGTGGTCAGAGGCTGTGTTGGACCCATAGTAGA 3660 

1 , 1 ! 1 1 

3661 ATTTTCCAGGTCACAGACCCAAGCTTCCATGGGTTGTTACTGTGCTGTACCACTTGGTGG 3720 

I I 

3721 GTCTGATTCTGAACCTGATGTGTGTGTT 3748 
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j J.* 1 1_ | 1 

1 AAAATTTGCTGATTAAAtgaatgtgggtgtgtttgagagggatcctagacagccaagcct €0 
1 + MWVCLRGILDSQAF14 

| j | \ | | 

61 tc tggcatgaaacgctgagaagatgggagtgtctgctggcagagatgaaagt-gagcaggg 120 
15 WHETLRRWECLI,AEMKVSRG34 

1 I- "I I 1 I 

121 gtgagcgcagccactgcccaacgcaaaccgtgaagaagcttctggaagagcagaggcgcc 18 0 
35 ERSHCPTQTVKKI*IiEEQRRR54 

, |- | J | | 

181 gccagcagcagcagcccgacgctggcggggtgcagggacaatttctccetcccccagagc 240 
55 Q QQQPDAGGVQGQFI.PPPEQ 74 

, | ~-| | 1 1 

241 agcccctgaccccatctgtgaatgaggctgtgactggccaccctcccttcccagcacact 3 00 
75 PLTPSVNEAVTGHPPFPAKS94 

J 1 — | 1 1 I 

301 cggagac tgtgggfctctggacctagcagcctgggctttccagac tgggaccccaacacgc 36 0 
95 ETVGSGPSSLGFPDWDPNTH 114 

| | , 1 ""I 

361 atgctgcctacactgacagcccctactcttgccctgcttctgctgccgaaaatttcctgc 420 
115 A A Y T D S P Y S C P AS AAENFIi P 134 

j„„ | | j 1 i 

421 ctcctgacttctacccaccctcggacccagggcagccgtgcccatttccccagggcatgg 480 
13S PDFYPPSDPGQPCPFPQGME 154 

[ 1 |- — | — J I 

481 aggctggaccctggagagtttctgcaccccctt.caggacccccacagtt.ccccgctgtgg 540 
155 AGPWRVSAPPSGPPQFPAVV 174 
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— I — I — I — I 1— I 

541 tccctggaccatcgctggaggtgg-cccgagctcacatgctggctititggggccacagcagc 6 00 
175 PGPS LEVARAHMIjALGPQQL. 194 

— - I- 1— - i -I- --I 1 

6 01 tgctggcccaggatgaggagggggacacgctccttcacctgtttgcggctcgggggctgc 660 
195 LAQDEEGDTIiLHLFAARGLR 214 

1 j ! 1 , , 

661 gctgggcggcatatgctgcggctgaggtgctccaggtgtaccggcgtcttgacattcgtg 720 
215 WAAYAAAEVLQVYRRLD IRE 234 

, 1 |_ \ | 

721 agcataagggcaagacccctctcctggtggcggctgctgccaaccagcccctgattgtgg 78 0 
235 HKG KTPIil*VAAAANQ PL I V E 254 

, | | | | | 

781 aggatctgttgaacctgggagcagagcccaatgccgctgaccatcagggacgttcggtct 840 
255 DLL»NI*GAE PNAADHQGRSVL 274 

I | l „„! , ] 

841 tgcacgtggccgctacctacgggctcccaggagttctcttggctgtgcttaactctgggg 900 
275 HVAATYGlt PGVLLiAVLKS GV 294 

\ | i | | | 

901 tccaggttgacctggaagccagagacttcgagggcctcaccccgctccacacggccatcc 960 
295 QVDLEARDFEGLTPliHTAI L 314 

I-- I I I I I 

961 tggcccttaacgttgctatgcgcccttccgacctctgtccccgggtgctgagcacacagg 1020 
315 ALNVAMRP SDLCPRVLSTQA 334 

j [„„ 7 „„| , 1 ! 

1021 cccgagacaggctggattgtgtccacatgttgctgcaaatgggtgctaatcacaccagcc 1080 
335 RDRLDCVHMLLQMGANHTSQ 354 
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1 1 1 , , 1 

10,8 1 aggagatcaagagcaacaagacagttctgcacttggccgtgcaggctgccaaccccactc 1X4 0 
355 E X KSNKTVLiHIiAVQAANPTL 374 

, | ! , | ! 

1141 tggttcagctgctgctggagctgccccggggagacctgcggacctttgtcaacatgaagg 1200 
375 VQLLLEXiPRGDLRTFVNMKA 394 

( ^.J | [ , I 

1201 cccacgggaacacagccctccacatggcggctgccctgccccctgggccggcccaggagg 12 6 0 
395 HGNTALKMAAAIjPPGPAQEA 414 

, | , ( | J 

1261 ccatcgfcgcggcacctgttggcagctggggcggaccccacactgcgcaacctggagaatg 132 0 
415 IVRHLIiA AGADPTL.RNI.ENE 434 

! | | , , | 

1321 agcagcccgttcacctgctgcggcccgggccgggccctgaggggctccggcagcfcgttga 13 80 
435 QPVHXiLRPGPG PEGLRQLLK 454 

| |„ |— , J | 

1381 agaggagccgtgtggcgccgccaggcctgtcctcttaggACTCAAACCCAGACCCTGGAC 1440 
455 RSRVAPPGL SS* 465 

, | ) ! ( \ 

1441 TGATTTTCCAGTCCCCACCGTCCTGCGGGACAGCCAGCGTATGCTAATGTTGCAAACCCA 1500 



| I- -| l t --| 

1501 TGATAATGTATGTGGAATATCCTGCCATTGGGGTTTTACATTAAAACCCCAGAATGGCTG 156 0 



I I — i I -I I 

1561 CAGAGGGGTGAACAGGCCCCAATATTTGGGGTGCTGTGATACCCCTCTTCTACCCACAAG 1620 



, 1 ! 1 1 -J 

1621 GAGCCCTCTTGATGATTTCTGTGAAATCGAGGCCCCTTGATTGTTTCTGTGAAACACCCT 1680 



! 1 1 -| 1 

1681 G CAC C CCTAGTC CTTTCCCCACTGAGATCTTTCGGGTT CT CT C CC CTAACTCAGCT 173 S 



f 1 I6r. <r C 



Patent Application Publication Jun. 26, 2003 Sheet 28 of 55 US 2003/0119024 Al 



1 1 1 1 1 1 

1 TTCGCCGGAGCGCGACCCGGGGACTCCCAGGCCTGTGGGCGGGCCCTGCCCAGGACTGGG 60 

, 1 1 1 1 1 

61 CGGTGCCATAACCCCTAGTTTAAATVACTCGCGGGTACCGGACCCftAGJVTCGGGGACCCGG 120 

1 1 1 1 1 1 

121 CGGCGG CT CCG CGGG GG AAAC AG CG AGGCT GG CGC AG CGCCG AGGCCG CGGCCCTGGGGG 180 

j 1 1 1 1 1 

181 CCCGCAATCCACGCCACGGAATCCCCGAGTGAGCAGGGGTGAGCGCAGCCACTGCCCAAC 240 

j 1 1 1 1 1 

241 GCAAACCGTGAAGAAGCTTCTGGAAGAGCAGAGGCGCCGCCAGCAGCAGCAGCCCGACGC 300 

1 j 1 ! 1 1 

301 TGGCGGGGTGCAGGGACAATTTCTCCCTCCCCCAGAGCAGCCCCTGACCCCATCTGXGAA 360 

j { 1 j 1 { 

361 TGAGGCTGTGACTGGCCACCCTCCCTTCCCAGCACACTCGGAGACTGTGGGTTCTGGACC 420 

! (-_— — j 1 1 1 

421 TAGCAGCCTGGGCTTTCCAGACTGGGACCCCAACACGCATGCTGCCTACACTGACAGCCC 480 , 

, 1 1 1— 1 1 

481 CTACTCTTGCCCTGCTTCTGCTGCCGAAAATTTCCTGCCTCCTGACTTCTACCCACCCTC 540 



£ \ G. "7 A 
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1 1 1 1 1 1 

541 GGACCCAGGGCAGCCGTGCCCATTTCCCCAGGGCa.tggaggctggaccctggaaragtttic 6 00 
1 ME.AGPWRVS 9 

, | , , J J 

601 tgcacccccttcaggacccccacagttccccgctgtggtccctggaccatcgct-ggaggt 660 
10AP PSGPPQFPAVVPGP SLEV29 

| 1 | 1 | 1 

661 ggcccgagctcacatgctggctttggggccacagcagctgctggcccaggatgaggaggg 72 0 
30ARAHMLALGPQQLLAQDEEG49 

, | , , , | 

721 ggacacgctccttcacctgtttgcggctcgggggctgcgctgggcggcatat.gctgcggc 7 80 
50DTLLHLFAARGLRWAAYAAA69 

| | i | I" "1 • 

781 tgaggtgctccaggtgtaccggcgtctt.gacattcgtgagcataagggcaagacccctct 840 
70 EVLQVYRRIjD IREHKGKTPIj 89 

! 1 , 1 1 ( 

841 cctggtggcggctgctgccaaccagcccctgattgtggaggat.ctgttgaacctgggagc 900 
90 LVAAAA NQPL I V E D t. L N I* G A 109 

1 j ! , ! 1 

901 agagcccaatgccgctgaccatcagggacgttcggtcttgcacgtggccgctacctacgg 96 0 
110 EPNAADHQGRSVLHVAATYG X29 

J | | 1 | f 

961 gctcccaggagtt.ctcttggctgtgcttaactctggggtccaggttgacctggaagGcag 1020 
130 LPGVLLAVLNSGVQVDLEAR X49 

I ""I I"' "I I 1 

1021 agacttcgagggcctcaccccgctccacacggccatcctggcccttaacgttgctatgcg 108 0 
150 DFEGLTPLHTAILAtiKVAMR 169 



Fi*. IB 
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, -| — , 1 | 

1081 cccttccgacctctgtccccgggtgctgagcacacaggcccgagacaggctggattgtgt 114 O 
170 P SDIaCPRVLSTQARDR LDCV 189 

I { [ , , , : 

1141 ccacatgttgctgcaaatgggtgctaatcacaccagccaggagatcaagagcaacaagac 1200 
190 HMLLQMGANHTSQE IKSNKT 209 



I ""I I I I ""I 

1201 agttctgcacttggccgtgcaggctgccaaccccactctggtt.cagctgctgctggagct: 1260 

210 VLHLAVQAANPTIiVQIiliXiEX. 229 

1 — I 1 1 1 1 

1261 gccccggggagacctgcggacctttgtcaacatgaaggcccacgggaacacagccctcca 1320 
230 P RGDLRTFVNMKAHGNTALH 249 

1— I — I I-- I --1 

1321 catggcggctgccctgccccctgggccggcccaggaggccatcgtgcggcacctgttggc X3 80 
250 MAAALPPGPAQEAIVRHLLA 269 

| , | 1 1 i 

13 81 agctggggcggaccccacactgcgcaacctggagaatgagcagcccgttcacctgctgcg 1440 
270 AGADPTLRNtiENEQPVHIiIiR 289 



l r 1 I I | | 

1441 gcccgggccgggccctgaggggctccggcagctgttgaagaggagccgtgt.ggcgccgcc 1500 
290 PGPGPEGXjRQLIjKRS RVAP P 309 

( | 1 | | | 

1501 aggc C t g t c c fcc 1 1 aggACTCAAACCC AGACC CTGGACTGATTTTCC AGTCCCCACCGT C 1560 
310 G L S S * 313 

| | t | I—-' 1 

15 SI CTGCGGGACAGCCAGCGTATGCTAATGTTGCAAACCCATGATAATGTATGTGGAATATCC 1620 
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, j 1 -| 1 \ 

16 21 TG C CAT TGGGGTT TT AC ATT AAAA C C C C AG AATGG CT G CAG AG GGGTG AACAGGCCCCAA 1680 

1 1 , , 1__ j 

16 81 TATTTGGGGTGCTGTGATACCCCTCTTCTACCCACAAGGAGCCCTCTTGATGATTTCTGT 1740 

1 1 1 , j 1 

1741 GAAATCGAGGCCCCTTGATTGTTTCTGTGAAACACCCTGCACCCCTAGTCCTTTCCCCAC 1800 

, 1 

18 OX TGAGATCTTTCGGGTTCTCTCCCCTAACTCAGCT 1834 



f\€r, ID 
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1- — 1- -I 1 1 -I ■ 

1 TAAC GAG CTTCCTT C C AC CGCAAAAG AGCTGG AG AACAATG CTAGG C AACGT GCTGG AG A 60 



-— I 1 i 1 1 1 

61 CCTTGGCCCTCGGAACCCAAaTCACSCCTCCCATGTGAGCTCTGGAGGGAGAACTTTatg 120 
1 Ml 

1 1 — | 1 1 1 

121 tgttgcactgagggcagfccfcccggaaacgcgattcgcagcgggcgccggaagcggtgtfcg 18 0 
2CCTEGSLRKRDSQRAPEAVL 21 

, | | | | ! 

131 tgtctgcagcfcctggcagaggactgfctccactagacacgctgaagggactgggtacgfcg^ 240 
22CLQLWQRTVPLDTLKGLGTC 41 

, 1 ! 1 1 1 

241 tfctccttcaggaccagagctgagaggagctgggafccgcggcggcaafcggaacgggcctca 300 
42FPSGPEIiRGAGIAAAMERAS 61 

I -I I I I I 

3 01 gaaaggcgcacggccagcgcgc ttt fctgcgggg t tccgggcct tgggactt t tcagcaac 360 
62ERRTASALFAGFRAI.GLFSN 8X 

1 , , 1 

361 gacattccacacgtggtgcggttcagcgcgcfccaagcgccggfctctatgtaacaacctgc 420. 
82 D I PHVVR F SAL KRRFYVTTC 101 

:.„i 1 1 1 1 1 

421 gtgggcaagagtttccacacctatgacgttcagaaact.t.agtctggttgcagtaagtaat 4 80 
102 VGKS FHTYDVQKliSXiVAVSN 121 
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1 1 1 1 J 1 

481 tctgttccacaggatatctgctgtatggcagctgatggcagattagtctttgctgcttat 54 0 
122' S VPQDI CCMAADGRLVFAAY 141 



J | [ | | | . 

541 ggaaatgttt.tctctgcattt.cfcccgtaat:aaagagatagtacatacctttaagggtcat 600 
142 GNVFSAFARNKEIVHTFKGH 161 



1 | 1 | I 1 

601 aaggcagaaatccatttcttgcaaccctttggagaccacattatctctgttgatactgat 660 
162 KAEXHFLQPFGDHIISVDTD 181 



I— | | \ J | 

661 ggcattcttattatttggcacatatattcagaagaagaatacctgcagttgacttttgat 72 0 
182 GILIIWHIYSEEEYIiQIiTFD 201 



721 aaatcagtatttaaaatttctgcaattttgcatccaagtacctacttgaataaaatactt 780 
202 KSVFKX SAIX.HPSTYLNKII< 221 

( | | I- | i 

781 ctgggcagtgaacaaggaagcctgcagttgtggaatgtaaaatccaataaacttctatat 840 
222 LGS EQGSLQLWKVKSNKXiLY 241 



, —| [ [ , J 

841 acatttccaggatggaaagttggagtgacagctcttcagcaggcaccagccgtggatgtt 900 
242 TFPGWKVGVTALQQAPAVDV 261 

| | | \ , | 

901 gttgctattggtcttatgtcaggtcaagttatcattcacaacattaaatttaatgaaaca 960 
262 VAIGLMSGQVI IHNIKFNET 281 

™l 1 1 1 ! i 

961 ttaatgaagtttcgtcaagactggggacccattacttcaatttcatttcgcacagatggt 1020 
282 LMKFRQDWGPITSISFRTDG 301 

1 1 1 1 1 -I 
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1021 catccagtaatggcagctggaagcccatgtggccatattggactctgggatctagaagac 1080 
302 HPVMAAGSPCGHIGLWDIiED - 321 

| J | 1 | | 

1081 aaaaaattaatcaaccaaatgagaaatgcacactctacagcaattgccggactgacattfc 1140 
322 KKL INQMRNAKSTAIA GLTF 341 

1 h 1 ! ! 1 

1141 ctccatagagagccacttcttgtcacaaatggcgctgacaatgctcttaggatatggata 1200 
342 L. H R E P L.LVTNGADNAL RXWI 361 

| | | , l ( 

1201 tttgatggtcctacaggt-gaaggccgacttttgagattcagaatgggtcatagtgctcct 12 60 
362 FDGPTGEGRLLRFRMGHSAP 381 

! I | | | | 

1261 cttaccaatatcagatattatggacagaatggacagcagattctaagtgcaagtcaagat 1320 
382 LTNIRYYGQNGQQILSASQD 401 

] | J- ( | | 

1321 ggaactcttcagtcattttccacggtacatgaaaaattcaataagagcttgggacatgga 1380 
402 GTLQS FSTVHEKFNKSLGHG 421 

J | | | | J 

1381 ttaataaataaaaagagagttaaacgtaaaggacttcagaataccatgtcagtgagactt 1440 
422 liINKKRV' KRKGXtQNTMSVRI* 441 

! | [ , ! 1 ■ 

1441 ccacccatcacaaagtttgcagcagaggaagctcgtgaaagtgactgggatggtatcatt 1500 
442 PPITKFAAEEARESDWDGII 461 

, | | I | I 

1501 gcttgccatcaaggtaagctatcttgctcaacctggaattatcagaaatctacaataggc 156 0 
462 ACHQGKIiS CS TWNYQKSTIG 481 

, t -\-| , ! | 

1561 gcttactttctcaagccaaaagagttgaagaaagatgacataactgcaacagcagtggat 1620 
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482 AYFLKPKELKKDDITATAVD 501 

I | | , , , 

162X ataacttcttgtgga.aactttgctgtaattggcctctcatcaggaactgtagatgtatat X6 8 0 
502 ITS CGNFAVIGLSSGTVDVY 521 

! , , , , 1 

16 81 aacatgcagtctggcatacatcgaggaagttttggcaaggatcaagctcacaagggatct 1740 
522 NMQSGXH .RGSFGKDQAKKGS 541 

, | | | | | 

1741 gttagaggtgtcgcagtggatggattaaaccagttgacagttaeaactggtagtgaagga 18 00 
542 VRGVAVDGLNQLTVTTGSEG 561 

, , , , 

1801 ttactcaaattctggaactttaaaaacaaaattttaatccattctgtgagcctcagttca 186 0 
552 LLKFWNFKNKILIHSVSLSS 581 

| | |- , , | 

18 61 tctccaaatatcatgttgctacatagggacagtggcattctgggactcgccttggatgac 1920 
582 S P N IM IrLHRDSGXIiGLAliDD 601 

„_| | | | | I 

1921 ttctccattagtgttctggacatagaaactaggaagattgtcagagagttttctggacac 1980 
602 FS I SVLDIETRKXVREFSGH 621 

, :j 1 1 1 1 

1981 caaggccaaataaatgacatggcttttagtcctgatggtcgttggttaataagtgctgcg 2040 
622 QGQ INDMAFSPDGRYlIalSAA 641 

| | | I | I 

2041 atggattgctctattaggacttgggaccttccttctgggtgcGttatagactgctttttg 2100 
642 MDC SIRTWDLPSGCLIDCFL 661 

, 1 I "| I 

2101 ttggactcggctcctctcaatgtttctatgtctcctactggagactttctggcaacttcc 2160 
662 LDSAPLNVSMSPTGDFLATS 681 
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• 1 1 [- -- J 1 1 

2161 catgtggaccaccttg-gaat.ttatctat.ggtccaatatititccctgt:at:t:cagt:tgttt.ca 2220 
682 HVDHLGIYIiWSNISLYSVVS 701 

, ( 1 , , 1 

2221 ttacggccaGttcctgcagattatgtcccttcaatagtcatgcttcctggtacttgtcaa 22 80 . 
702 LRPIiPAQYVPSIVMLPGTCQ 721 

( 1 | | — I | 

22 81 acccaagatgtagaag t atcagaagaaacagtagaaccaag tgatgaa t tgatagaatat 2340 
722 TQDVEVSEETVEPSDEIj I E Y 741 

I I"" i 1 I I 

2341 gattcgccagaacagttgaatgagcaattggtgactctttcacttcttcctgaatcacga 24 00 
742 DSPEQLNEQLVTLSLLPESR 761 

| | | J | | 

2401 tggaaaaaccttcttaaccttgatgttattaagaaaaagaataaaccaaaggaaccaccc 2460 
762 WKNLLNLDVIKKKNKPKEP P 781 

J I | 1 J I 

2461 aaagtacccaaatcagcaccatttttcattccaacaattcctggccttgtacccagatat 2520 
782 KVPKSAPFFIPTIPGLVPRY 801 

| -| ( ! | 1 

2521 gctgcacctgaacaaaataatgatccccagcagtctaaagtggtaaatct.tggagtttt.g 2580 
802 AAPEQNNDPQQSKVVNL.GVI. 821 

I 1 1 1 \ 1 

2581 gctcaaaaatcagatttctgcttgaaacttgaagaaggactggtaaataataagtatgac 2640 
822 AQKSDFCLKXiEEGLVNNKYD 841 

| | |— J 1 I 

2641 actgctctcaaccttctgaaagaatcaggcccatcaggaattgaaacagagctgcgaagc 270 0 
842 TALNI.LKES*GPSGIETELRS 861 



F/6-. 2>€ 
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1 1 _--_( „_., 1 1 

27 01 ttgtctcctgattgtggtgggtccatagaagttatgcagagcttcttgaaaatgattggg 2760 
862 LSP pCGGSIEVMQSFLKMI G 881 

, | , , J |- 

2761 atgatgctggacagaaagcgtgattttgagttagcccaggcataccttgcattgtttcta 282 0 
882 MMX.DRKRDFELAQAYLALFI, 901 

, 1 | , , | 

2 821 aagttacaccttaaaatgcttccttcagagccagtactcctagaagaaataacaaatttg 288 0 
902 KIiHLKMIiPSEPVLIiEEITNXi 921 



| J , , J | 

2881 tcatcccaggtggaagaaaactggacccatttgcaatcactcttcaatcaaagcatgtgt 2940 
922 SSQVEENWTHliQSr-PNQSMC 941 

, _| j , 1 

2 941 attttaaattatctcaaaagtgctttgttgtaaaAATAAATTTGTGACTAAACAAAGACT 3 000 
942 ILNYLXSALL + 951 

T "l |— I" 1 

30 01 TTCATATTAAATGGGTTCAATTGAACTCATTTCTTATTTTCCAAGTGTC 3049 
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1 1 , , 1 

1 AGATTT AAGT AAGT CTT C CCC AACACCGAATGGGATTCC ATCTT CAG AC CCAG CCAGCG A €0 



! „ M , , , , 

61 TGGCatggaccccttccatgcttgcagtattcttaagcaactcaaaacaatgtacgatga 120 
1 MDP FHAC S X IjKQIjKTMY DE 19 

| , , | I— -| 

121 aggacagtfcgacagacattgtagtggaagtggatcacgggaaaacattttcctgtcatag 160 
20 GQLTDIV VEVDHG K - T F S C H R 39 

, , ! , j 1 

181 aaacgttcttgctgcaatcagcccttacttcagatccatgttcactagcggccttacaga 240 
40 N V L A A I S PY FRSMFTSGIjTE 59 

, j \ j , 1 

241 aagtactcaaaaagaagttcgaatagttggtgttgaagctgaatcgatggatttagtgtt 3 00 
60STQKEVR.IVGVEAESMDIjVL79 

! 1 | | 1 | 

301 gaactatgcctacacttccagagtbattcttacagaggccaatgttcaagccttgttcac 3 SO 
80NYAYTS R V ILTEA NVQALFT 99 

, ! | | , 1 

3 61 tgcagctagcatct.tccagattccttccatccaagaccaatgtgctaagtatatgat:cag 420 
100 AAS I FQI PSIQDQCAKYMIS 119 

| , | | | , 

421 tcatttggacccacagaattctattggggtctttatctttgctgatcattatggtcatca 480 
120 HLDPQNS IGVFIFADHYGHQ 139 

I I -I — 1 1 1 

481 ggaactcggagatcgatcaaaagaatacattcgtaaaaagtttctgtgtgtcaccaaaga 540 
140 ELGDRS KEYXRKKFLCVTKE 159 
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— -I I 1 1 1 1 

541 acaagagtttctccagttgacaaaagaccaactgataagtatactagacagtgacgattt 600 
160 QEFLQLTKDQLJS ILDSDDL 179 

1 , 1 1 , ! 

601 aaatgtagaccgagaagagcatgtttatgaaagcattataaggtggtttgagcatgaaca 6 60 
1B0 NVDREE*- : HVYESIIRWFEHEQ 199 



„- | , | , | 

6 61 gaatgaaagagaagtgcaccttccagaaatttttgctaaatgcatacgttt.tcctct.gat; 72 0 
200 NEREVHLPEIFAKCIRFPLM 219 

| |. , , , , 

721 ggaagatacctttatagagaaaattccacctcagtttgcacaggctatagccaaaagctg 7 80 
220 EDTFIEKIPPQFAQAIAKSC 239 



781 tgtagaaaagggaccatccaacaccaatggccgtacacagaggcttggaatgactgcttc 84 0 
240 VEKGPSNTNGCTQRLGMTAS 259 

, | , , | | 

841 tgaaatgatcatatgttttgatgctgcccacaaacactcaggaaagaagcaaacagtgcc 9O0 
260 EMI I CFDAAHKHSGKKQTVP 279 

( , ! , | 

901 ttgtctagatatagtcacaggaagggtgtttaaactatgcaaaccaccaaatgacctgag 960 
280 CLD I V T GRVFKXjCKPPNDLR 299 



, | | , | 

961 agaagt tgggat tct tg ta tcaccagataatgacatt tacat tgcaggagggtacaggcc 1020 
300 EVGILVSPDNDIYIAGGYRP 319 

I I" "I I I I 

1021 aagcagcagtgaggtctccatcgaccataaggcagaaaatgatttctggatgtatgatca 1080 
320 SS SEVS I DHKAEND FWMYD H 339 

| | , „, | 

1081 ttccaccaatagatggctatccaaaccatccttgcttcgagccagaataggctgcaaact 114 0 
340 STN RWL SKPS LLRARIGCKL 359 



F<6-. <\e> 
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1 , J , , 1 

.1141 tgtctattgctgtggtaaaatgtatgcaatcggaggtcgtgtttatigaaggtgatgggag 1200 
360 VYCCGKMYAIGGRVYEGDGR 379 

, 1 , 1 , 

12 01 aaactcactaaaatctgttgagtgctacgacagtagagagaattgttggacgactgtttg . 126 0 
380 NSLKSVECYDSRENCWTTVC 399 

, , J I t J 

1261 cgcgatgccagttgcaatggaatttcataatgctgtggagtacaaagagaagatctatgt 1320 
400 AMPVAMEFHttAVEYKEKIYV 419 

| 1 | | 

1321 tttacagggagaattt:tttctctitcfcatgagcctcaaaaagactact:ggggt.tt:ct:taac 13 80 
420 LQGE FFLFYEPQKDYW G FLT 439 

| | , , | 1 

13 81 ccccatgactgtgcctagaatccagggcttagcagctgtatacaaggactctatctacta 144 0 
440 PMTVPRIQGLAAVYKDS IYY 459 

| [ | , j | 

1441 cat:agctggaacctgtggaaatcatcaacgtat:gtttiactgt:agaagGCt:atgatat.tga 1500 
460 IAGTCGNHQRMFTVEAYDIE 479 

l , J 

1501 gctaaataaatggactcgtaagaaagactttccatgtgatcagtccataaatccatacct 1560 
480 LNKWt'rKKDFPCDQS INPYL 499 

| | J | | _--| 

1561 taaactggtacttttccagaacaaactccatttatttigtticgagctacticaagtigactigt 162 0 
500 KL1VX1FQNKLHI1FVR AT QVTV 519 

, | , , , | 

1621 .tgaagaacacgtcttcagaaccagcagaaaaaafct ccctt taccaatatga tgacattgc 168 0 
520 EEHVFRTSRKNSLYQYDDIA 539 

1 1 1 1 1 1 

1681 tgaccagtggatgaaagtgtatgagaccccagatcggctctgggaccttggccggcatfct 1740 
540 DQWMKVYETPDRL.WDL.GRKF 559 
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, 1 1 1 | 1 

1741 tgaatgtgctgttgctaaactgtatcctcagtgtcttcagaaagtactictaaATGAGTAG 18 0 0 
550 ECAVAKLYPQCIiQKVL,* 575 

, | I | | 1 

1801 CAGGCCTTAGTGCATCACTGGCATCTCATTCTTAGGAAACTTGTCTTTGATACAAAAGAG 1860 

1 1 , , 1 1 

1861 TG CTG AC AGT ATTTC AG AAAGC T GAGAG AGTTTT AT ACATGG AAAATGGGT ATGCTT AAA 1920 



, 1 j 1 1 1 

1921 GATTGCAGGGTAGGGAGGGATTTTCCTTCATCCTTGTGAC ATTTC ATTTCAGTAAGGAAA 1980 

, 1 1 1 1 1 

, 1 _|_-_ -| 1 1 

2 041 TTATAA.TTTTGTGA.TAAATAGTAACTGAGGTACCAGA.TGJVATCA.GGACAACTATGCACTC 2100 

| 1— - t — I- I" — I 

2101 TTATAAGAGCPiTTTAGGGTATTATTGGGTAAAGACGTCTAAACTTGTTTGATGTGACTTT 216 O 



I I I I I — "I 

21S1 TAATTTTAAR.TACGGGTAACAATCTGAGGCAATATCACTAGGACTTTAGCTGTGACCTCT. 2220 



, | | , | | 

2221 CTAACACAGAGAAGCACTAACTTAGATCCTCATTCTTAATATTTATATGTATCTATTTTT 22 80 



,. 1 1- I 1 

2281 GTGTACTGTTTTCAAGTGTACTG AGATTTAAATGTGTTCTATTATTAGAGTAGATCGAAG 234 0 
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1 ----1 1 1 1— i 

2341 AAAAAATTAGTCTCAGAAA.GAGCTTTTAGTCTGATTGTTTCCATTTCCCATGTAATTXTA 2400 

— -I" I" I I — I" 1 

2401 AGTTAAGCTAAAGTTTTAAAGTGGCAGTTTTCTGTCGATGACTTTTTCAAGXGCTAACAC 245 0 

| -..-j | | | 1 

2461 TGTCTCTTTTGTGAAAATCTGGAAJUVGTGCTCATATTCACAGGTGGCTGGTGCTAGTCTA 2520 

- —I 1 1 1 1 1 

2521 ACTTAATTCATGTGTATAACTAGATGGATTTAAATGGTCTGAGCCTATGCCTATCTTTCA 258 0 

| | | t | [ 

2581 AATTGGTGTGGATTTC ATGGCCATAGT ACTTT ACCTGTTGAACTCTTGTG ATTTCACAAG 264 0 

— -I --1 1 I — -I 1 

2641 ATTCTCTACTTATGTGATAGGAGGGTATGGCCAGTTATTCATCTAACTGGACTCAftTCT'X 2700 

t i | | | 1 

27 01 AG AAT AGT AGGAAC ATT AT ACCC AGTTTGC ACT AACATGGGCCATTTGT AG C C CAACCTT 2 760 

j 1 1 -j 1 :— i 

27S1 CTCTTCCATCTACCTGTCCATTCATTATTGGTACAAGGAAAGGTAACTTATTTCTCTTCT 2820 

J | | | 1 I 

2821 GCACAGAGCATAATGTGAAGTTTTATACCTACTTTTAAAATTCTGCTTTCCAGAAACAAA 2880 

1 ( 1 — | 1 1 

2881 ATTCCTGCAGTGGTCTAATTTAATGTCTTTAAGTTTCATATTACAATTAAAACCTCATTT 2940 



Fig. <H 
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, j 1 1 1 1 



j 1 , , , 

3001 GCTGAAGGTGA.TTGAAAAGGAAAAGAGTGAGTGRACA.GAA.CCATAGCTTTCTR.GGTACTA .3 060 



I" — I 1 I I- ""I 

3 061 AAGC ATTTTTTGCATTT AACTG ATGAAATTTCTAAC AATCATCAGTTAGGAAT ATTAACA 312 0 



1 1 ( | , 1 

3121 TGAAGGATAAACCAACTTATTTGTATACCTAAGGCAGGCATTTGGATCAGTAACATGT"rT 3160 



1 — I 1 1 1 ! 

3181 TACTAAGCCTAGAGTAATTCGTAAAGGGTATAAGCATAGGACAGATTTTGCCCTCAATCA 3240 



, | , , __._l_.__ | 

3241 CAATATTTGTATTCACTTGAAAGCAAACTGGCATGGTTCGTATTTTAAAAATCTTGCACA 3300 



[ , ! ! 1 1 

33 01 AATTGTAATGTGATACTGTGAAACAAATTGAAAACATTGCCTCTTTGCATCACATACCTC 336 0 



""I ----j [— | | 

3 361 GTTTTTCAG AAACTTTCCAAACTGCTTTAC ATAGACCTCT ACAAGTAGGGAATGTTTTCT 3420 



| | | 1 1 | 

3421 GAAGCAGAAGTTAAAATGGACAGCATTTCTAGAATTAACATTTTAAAATCTAGTCTTAGC 34S0 



| | | 1 | 1 

3481 TAGATATGTGGTTTCTTCTTATTGGTGTTGATAGTATGTCTGTAATCTCTGTATAAACTT 3 540 



Fig. 9f 
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1 1 1 1 1 1 

354X TGTCAACATTTTTACCTCCCCAGTTTTATCTTCTGTTTTGTTTTTGTTTTTATCATCATG 36 0 0 



1 1 —i r 1 1 

3601 ATGTTTTGG AGTT ATT ACTGTGTATTTT AG AAATCATTCTTTACAGTTTTGC ATTGCTGA 3 S 6 0 



- --I ;-l 1 1 1 1 



3723. TGTTGAATAAGATTTCCTGCCGCTTTTTGACAATCTTGTGTATTTAGAAA3UVTGT ATT AC 3780 



- -1 1 1 1 1 1 

3781 TTGAAAACATGACATAGAACATTGAGTTAGCAATTTACATGGGCTGTATGTTATATAAGA 3840 



— " I I"" I— I I I 

3841 GPJVTGACATACTGTGGCTAATTCAACAGTAGATTTATTCTTTTAGCCTGCACAACAGTTG 3900 



| | , | j 1 

3901 ATCTTTTGGCTATGACAATTTGTATGGAGGGTACGATCTAAGTTAAGTGTGTCAAAAGCA 3 960 



1 i — -t -I—- i 1 

3961 AGGCTTAGGATTTGTTATGGGAGTAGAATAT ATATTGAATTTTGTATGAAGAACTATTTG 4020 



I I— " I" "I I I 

4021 TTTAAATTATATAGCTGGGATATTTTGCCACTGTTAAAATGGATTCAGAR.GAGGTCCTAG 4080 



— - I 1 I I" I 1 

4081 AAAAGTAAGATTAGTGACATGTGTGGGTTTATATTTAGATATTTAAGGTGCATTTTCATA 414 0 
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1 1 1 j 1 ---I 

4X4 L GTGTGGTAAGACCTTAAGTAAAAGGCACAATGGGTACTACAGAATTAAAATGTAGGTCTA 42 00 

1 1 1 1 1 1 

4 201 ACATAATGCCAGTTCCACTTTAACTTTGTTTTTGCAXTTQAAGAATGTATGTAGCACTTT' 4 2 GO 

-- I — — I 1- 1 1 i 

42 61 C CT AT AT ATTTGT C ACAC ATTG AAAACTGG ACTGGGTAT AACT ATGTT AT AGGAAAGT AG 432 0 

I 1 1 --I 1 1 

4321 AAATTGTATTCTTTATTTTCCATCTTTGTTTTCTGTTCTA.CAAAGTTGA.TQCTTAAGCAT 4380 

j | | | | 

43 81 C AAGCTGATTTTATTGGTCATGAG AACAAATGGATGTGATCATGAAGGAATCAGATTCCC 4440 

1 \— 1 j 1 1 

4441 TATGTAAAGCAGTTTAAAA.TGGAATTCAATGTTCAGTGCTCAGGTATGTAGTAA.GTACTG 45 00 

"I I "I 1 I 1 

4 501 TAGTCCTGTGGGGGCAAATGTGT AG ATATTTTTAAACATTTTGCCAT AATTGC ACAATTT 4 560 

1 I I I I 

4561 TTTGCATTTTTACCTGATGTCATTGTTTCTTATAATAAAACCTTTTCTGATTGAAAA 4617 



Fi*. 
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j 1 1 1 1 1 . 

1 CTGGAGACTGGAAGGTCCAAGATCAAGATACTACAGATTTGATTTCTGGACGTTGAACAT 60 

; 1 1 1 1 1 

61 GGTGTAGGAGTAGAAAAG$AACAGGGACGGAAGGAGAGAACTTACCCCTTCAAGCCCTTT 120 

! 1 j ._, 1 1 

121 TAT AAGGC ACTAAATCCC ATCATT G AGGGCAGAGTCCTG AT AG CCT AATC ACCTCCT AAA 180 

1 1 1 1 1 1 

181 TGCTCCATTTCTTAATATTGTTGCACTGAGGATTAAGCTTCAACATGAATTCTGAAGAGG 240 

1 1_: j 1 1 1 

241 AC AG AAAC AT CC AAACCATAGCAGTCAATGCCTTAGCCCTTG ATGTTGCT AT CAACCTGA 300 

1 1 1 1 1 1 

301 GATTCGGGGATCAAGGAAGGACAGGTAATAGTTAACCTCTTCTGTGAGAAGTCAGAAGGT 360 

j j 1 1 i i 

361 GATCTCTTTAATGCTTTCTTTTTAAGAATTTTTCAAATTGAGACTAATTGCAGAGGTTCC 420 

1 -—j j 1 j 1 

421 AGTT G ACC AGCATTCAT AGG AATGAAG AC AAACACAG AGATGGTGTGT CT AAGAAACTTC 480 

! —j 1 1 1 ! 

481 AAAAGGTGTAGACCTCCTGACTGAAGCATATTGGATTTATTTAATTTTTTTCACTGTATT 5 40 

1 , , 1 , _.| 

541 TCTGTCCTCCTACAAGGGAAAGTCatgattacactaactgagctaaaatgcttagcagat 600 
1 MXTLTSX.KCZ.AD'12 

, 1 , , , 1 

601 gcccagtcatcttatcacatcttaaaaccatggtgggacgtcttctggtattacatcaca 660 
13AQSSYHILKPWWDVFWYYIT 32 

, 1 1 1 1 1 

661 ctgatcatgctgctggtggccgtgctggccggagctctccagctgacgcagagcagggfct 720 
33 LIMLIjVAVtiAGALQLTQSRV 52 

1 1 1 j 1 ( 

721 ctgtgctgtctt.ccatgcaaagtggaatttgacaatcactgtgccgtgccttgggacat:c 780 
53 LCCIjPCKVEFDNHCAVPWD I 72 
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7 81 ctgaaagccagcatgaacacatcctctaatcctgggacaccgcttccgctccccctccga 84 0 
73 LKASMNTSSNPGTPIj PLPLR 92 

| 1 | 1 | | 

841 attcagaatgacctccaccgacagcagtactcctatattgatgccgtctgttacgagaaa 900 
93IQNDLHRQQYSYI DAVCYEK 112 

| | | | | I 

901 cagctccattggtttgcaaagtttttcccctat.ctggtgctct:tgcacacgctcat:ct:tt: 96 0 
113 QLHWFAKFFPYLVLLHTLIF 132 

I T | | , , , 

961 gcagcctgcagcaactttcggcttcactaccccagtaccagttccaggctcgagcatttt 1020 
133 AACSNFWLHYPSTSSRLEKF 152 

| , , , , J 

1021 gtggccatccttcacaagtgcttcgattctccatggaccacccgcgccctttcagaaaca 10 80 
153 VAIIiH KCFDSPWTTRALSET 172 

| | j , I 1 

10 81 gtggctgagcagtcagtgaggcctctgaaactGt.ccaagtccaagat.t.tt:gct.ttcgtcG 1140 
173 VAEQSVRP LKLSKSKIIil-S S 192 

1 1 ! 1 i 1 

1141 tcagggtgttcagctgacatagattccggcaaacagtcattgccctacccacagccaggt 12 0 0 
193 SGCSADIDSGKQSL.PYPQPG 212 

| | , J , | 

1201 ttggagtcagctggcatagaaagcccaacttccagtgtcctggacaagaaggagggtgaa 126 0 
213 LESAGIESPTSSVLDKKEGE 232 

, ! „| | ( 

1261 caggccaaagccatctttgaaaaagtgaaaagattccgcatgcatgtggagcagaaggac 1320 
233 QAKAIFEKVKRFRMKVEQKD 252 

J ! | ) | t 

1321 atcat:t.tat:agagtatatctgaaacagataatagtcaaagtcatt:t:tgfctt:gtgct.catG 1380 
253 I IYRVYIiKQIIVKVI Ii F V L. I 272 

, - H | ( , | 

1381 ataacttatgttccgtattttttaacccacatcactcttgaaatcgactgttcagttgat. 1440 
273 XTYVPYFLTHITLEIDCSVD 292 

1 ! 1 1 ! 1- 

1441 gtgcaggcttttacaggatataagcgctaccagtgtgtctattccttggcagaaafcctfcfc 1500 
293 VQAFTGYKRYQCVYS LAEIF 312 

„„| | | | | I 

1501 aaggtcGtggcttcattttatgtcattttggttatactttatggtctgacctcttcctac 1560 
313 KVX.ASFYVILVILYG LTSSY 332 

| | | | | 1 

1561 agcctgtggtggatgctgaggagttccctgaagcaatattcctttgaggcgttaagagaa 1620 
333 SLWWMLRS S I* K Q Y S F EAIjRE 352 

( | -\-_| , I | 

1621 aaaagGaactacagtgacatccctgatgtcaagaatgactttgcGttcatccttcatctgr 1680 



F 16. IDE 
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353 KSNYSDIPDVKNDFAFILHL 372 

| | | I , | 

16 81 gctgatcagtatgatcctctttattccaaacgcttctccatattcctatcagaggtcagt 174 0 
373 ADQYDPLYSKRFSIFLSEVS 392 

J | , t ,„.J , 

1741 gagaacaaactgaaacagatcaacctcaataatgaatggacagttgagaaactgaaaagt 18 00 
393 ENKLKQINLNNEWTVEKLKS 412 

| | , , , | 

18 01 aagcttgtgaaaaatgcccaggacaagatagaactgcatctttttatgctcaacggtctt I8 60 
413 KLVKNASDKIELHLFMLNGI, 432 

I | | , , , 

18 61 ccagacaatgtctttgagttaactgaaatggaagtgctaagcctggagcttatcccagag 192 0 
433 PDNVFELTEMEVLSLELIPE 452 

1 I ( I i t 

1921 gtgaagctgccctctgcagtct.cacagctggtcaacctcaaggagcttcgtg'tgtaccat 198 0 
453 VKIiPSAVSQLVNLKELRVYH 472 

I |--~ | | I I 

1981 tcatctctggtcgtagaccatcctgcactggcctttctagaggagaatttaaaaatcctc 2 040 
473 SSIiVVDHPAIiAFLEENIiKIL 492 

I I I I I I 

2041 cgcctgaaatttactgaaabgggaaaaatcccacgctgggtatttcacctcaagaatctc 2100 
493 RLKFTEMGKIPRVIVFHIiKNIi 512 

-------I I i 1 1 1 

2101 aaggaactttatctttcgggct:gtgttct.ccctgaacagttgagtactatgcagttggag 2160 
513 KELYLSGCVLPEQLSTMQLE 532 

1 ! ( , -1 1 

2161 ggctttcaggacttaaaaaatctaaggaccctgtacttgaagagcagcctctcccggatc 2220 
■ 533 GFQD1iKN1»RTIjY I» K S SLSRI 552 

1 \ J ! , , 

2221 ccacaagttgttacagacctcctgccttcattgcagaaactgtcccttgataatgaggga 22 8 o 
553 PQVVTDliLPSLQKLSLDNEG 572 

I I I | I 1 

2281 agcaaactggttgtgttgaacaacttgaaaaagatggtcaatctgaaaagcctagaactg 2340 
573 SKIiVVLNNLKKMVNIiKSLEL 592 

I J [ 1 1 l 

2341 atcagctgtgacctggaacgcatcccacattccattttcagcctgaataatttgcatgag 240 0 
593 ISCDLERIPHS I FSLNNtiHE 612 

I I J I ( \ 

2401 ttagacctaagggaaaataaccttaaaactgtggaagagatcattagctttcagcatctt 2460 
613 LDLRENNLKTVEEI ISFQHL 632 

( „ „|— „w-.-| I I I 

2461 cagaatctttcctgcttaaagttgtggcacaataacafctgcttatattcctgcacagatt 2 520 
633 QNI1SCI1KL1WHNNIAYI PAQI 652 



FtG. IOC 
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1 1 j. , , -| 

2521 ggggcattatctaacctagagcagctctctttggaccataataatattgagaatctgccc 2 580 
653 GAliSNLiEQLfSLDHNNIENIaP 672 

| | , | , I 

2581 ttgcagcttttcctatgcactaaactacattatttggatctaagctataaccacttgacc 264 0 
673 LQLFLCTKLHYIiDLSYNHIjT 692 

_ J | | | , „| 

26 41 ttcattccagaagaaatccagtatctg-agtaa.tittgcagt.acttitgct.gtigaccaacaac 2 700 
693 FIPEEIQYLSNLQYFAVTNN 712 

| .| | | 1 j 

2701 aatattgagatgctaccagatgggctgttXcagtgcaaaaagctgcagtgtttactt-ttg 276 0 
713 NIEMIiPDGLFQCKKLQCLI/li 732 

| | t „. r , | J 

2761 gggaaaaatagcttgatgaatttgtcccctcatgtgggfcgagcfcgt.caaaccttactcat 2 820 
733 GKNSLMNLS PHVGELSNI.TH 752 

1 1 1 1 V 

2 821 ctggagctcattggtaattacctggaaacacttcctcctgaactagaaggatgtcagtcc 28 80 
753 LEXj XGNYXi'E T X* P P E XjE GCQ S 772 

| | | , | —| 

23 81 ctaaaacggaactgtctgattgttgaggagaacttgctcaatactcttcctctccctgta 294 0 
773 LKRNCIjIVE ENLLtfTLPLPV 792 

, 1 1 ( 1 1 

2941 acagaacgtttacagacgtgcttagacaaatgttgacTTAAAGAAAAGAGACCCGTGTTT 3 000 
793 TERXiQTCI.D!CC* 803 

1- I- —I — 1 1 1 

3 001. CAAAATCATTTTTAAAAGTATGCTCGGCCGGGCGTGGTGGCTCATGCCTATAATCCCAGC 3060 

| | t ! | 

3061 ACTrTGGGR.GGCCAAGATGGGCGGATTGCTTGAGGTCAGGAGTTCGAGACCAGTCTGGCC 3120 

, | | | | -- I 

3121 AACCTGGTGAAACCCCATCTCTGCTAAAACTACAAAAAAATTAGCCAGGCGTGGTGGCGT 3180 

, |-_-_ | — I 

3181 GCGCCTGTAATCCCAGCTACTTGGGAGGCTGACGCAGGGGAATTGCTTGAACCAGGGAGG 3240 

I I I I I I 

3241 TGGAGGTTGCAGTGAGCCGAGATTGTGCCACTGTACACCAGCCTGGGTGACAGAGCAAGA 3 300 

| \ [ __J | | 

3 301 CTCTTATCT CAAAAAAAAAAAAAAAT GCTCCAGGGCTTT AAATGAGAAGT AAAATTTTCT 3360 

1 1 1 — 1- — I -1 

3361 AAGTTAATAAAGATGAAGAATGGGTGACTATTATGATGAACCATAACTAAATGTCTTATT 3420 
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3 421 AAAGCAACTGA.GTGTCTAGCCCTAAATTAACCAGGTAAAAACTGTTAACACTAACCTGAA 34 8 0 

J , | | 

3481 GTTTTGTGAATAACTGTTCTrTAACTTATTGAGATGTTGCAAGAAATGCACATCCAGGGT 3 540 

, ( , 

3 5 4 X GG ACTGGG AGCT ATG AAATGACTAAATTCCTCCTTGCAGTGTTTACCT 3588 
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GENES AND PROTEINS ASSOCIATED WITH T 
CELL ACTIVATION 

[0001] This application claims benefit of U.S. Provisional 
Application No. 60/306,968, filed Jul. 20, 2001, which is 
hereby incorporated by reference in its entirety. 

1. FIELD OF THE INVENTION 

[0002] The present invention relates to novel T cell acti- 
vation-associated proteins (TCAPs), in particular to a G 
Protein-coupled Receptor (TA-GPCR), two GTPase -Acti- 
vating Proteins (TA-GAP), a serine/threonine class 2C phos- 
phatase (TA-PP2C); an NF-KB-like transcription factor (TA- 
NFKBH); a keich repeat-containing protein (TA-KRP); a 
transducin-like protein with a WD motif -containing domain 
(TA-WDRP); and a leucine repeat-rich protein (TA-LRRP); 
and derivatives thereof, the genes encoding them, and 
derivatives thereof. Production of proteins, derivatives, and 
antibodies is also provided. The invention further relates to 
therapeutic compositions and methods of diagnosis and 
therapy. 

2. BACKGROUND OF THE INVENTION 

2.1. GENE EXPRESSION IN T CELL 
ACTIVATION 

[0003] The study of gene expression changes has played a 
major role in development of the understanding of T lym- 
phocyte activation. During an immune response, T cells 
interact with antigen presenting cells (APCs) in a complex 
process involving intercellular interactions between many T 
cell surface receptors and cognate ligands on the APCs. 
During these encounters, T cells undergo an elaborate tran- 
scriptional response, leading to cellular differentiation and 
acquisition of immunologic function (Crabtree, Science 
243:355-61(1989)). Tcell activation also plays a central role 
in development of immunologic mechanisms of disease (W. 
Paul, ed., Fundamental Immunology, Third Edition, Raven 
Press, New York, 1993). An understanding of the molecular 
basis of T cell activation is therefore essential to both our 
understanding of immune responses and of how to manipu- 
late them therapeutically. Gene expression changes accom- 
panying T cell activation and differentiation have been the 
subject of numerous studies (Choi, et al, Cell. Immunol. 
168(l):78-84 (1996); Zipfel, et al., Mol. Cell. Biol. 
9(3): 1041-8 (1989); Zheng & Flavell, Cell 89(4):587-96 
(1997); Liu, et al., Genomics 39(2):171-84 (1997); Renner 
et al., J. Immunol. 159(3):1276-83 (1997); Ishaq, et al, J. 
Biol. Chem. 14:273(33):21210-16 (1998); Teague, et al., 
Proc. Natl. Acad. Sci. U.S.A. 96(22):12691-96 (1999); 
Hedrick, et al., Nature 308:149-53 (1984); Yanagi, et al., 
Nature 308:145-9 (1984); Brunet, Immunol. Rev. 103:21-36 
(1988)). 

[0004] Comparing patterns of gene expression is a widely 
used means of identifying novel genes, investigating gene 
function and finding potential new therapeutic targets (Shiue 
et al., Drug Devel. Res. 41:142-159 (1997)). The study of 
gene expression changes has played a major role in devel- 
opment of our understanding of T lymphocyte activation. 
With the completion of the human genome sequencing 
effort, it is now a realistic goal to document all gene 
expression changes that occur during T cell activation (Mar- 
rack, et al, Curr. Opin. Immunol. 12(2):206-9 (2000)), but 



it is more difficult to assess the relevance of these changes 
for immunologic function. Historically, many techniques 
have been used to identify and clone differentially expressed 
genes (Liang et al., Science 257:967-71 (1992); Welsh et al, 
Nucleic Acids Res. 20(19):4965-70 (1992); Tedder et al, 
Proc. Natl. Acad. Sci. U.S.A. 85(1):208-12 (1988); Davis et 
al, Proc. Natl. Acad. Sci. U.S.A. 81(7):2194-8 (1984); 
Lisitsyn et al, Science 259:946-51 (1993); Velculescu et al, 
Science 270:484-7 (1995); Diatchenko et al, Proc. Natl. 
Acad. Sci. U.S.A. 93(12): 6025-30 (1996); Jiang et al, Proc. 
Natl. Acad. Sci. U.S.A. 97(23): 12684-9 (2000); Yang et al, 
Nucleic Acids Res. 27(6):1517-23(1999)). However, these 
are generally not well suited for discerning the functional 
significance of gene expression differences. In many cases, 
these differences are not unique to a particular cellular 
pathway and the specificity of these changes becomes appar- 
ent only after secondary characterization using labor inten- 
sive techniques (Shiue et al., Drug Devel. Res. 41:142-159 
(1997)). 

[0005] Recently, the technique of DNAmicroarray hybrid- 
ization has been used to quantify the expression of many 
thousands of discrete sequences in a single assay known as 
expression profiling (Wang et al, Gene 229(1-2): 101-8 

(1999) ; Schena et al, Science 270:467-470 (1995); Lock- 
hart, et al, Nat. Biotechnol. 14:1675-1680 (1996); Lockhart 
et al, U.S. Pat. No. 6,040,138). Many applications have 
been described for expression profiling, but perhaps most 
relevant to elucidating gene function is the development of 
tools used to group genes according to similarities in pat- 
terns of gene expression in expression profiling experiments. 
Coexpression of genes of known function with poorly 
characterized or novel genes has been suggested as a method 
to assign function to genes for which information is not 
available (Eisen et al, Proc. Natl. Acad. Sci. U.S.A. 
95(25):14863-8 (1998)). Using a reference database or com- 
pendium of expression profiles from Saccharomyces cerevi- 
siae, novel open reading frames (ORFs) were used to show 
that coordinated transcriptional regulations were enriched 
for a given phenotype (Hughes et al, Cell 102:109-126 

(2000) ). In human cells, coregulation of uncharacterized 
expressed sequence tag (EST) sequences with known genes 
was noted, but no evaluation of the identities and properties 
of these ESTs was made. 

2.2. G-PROTEIN COUPLED RECEPTORS 

[0006] G-protein coupled receptors (GPCRs) form an 
extensive family of transmembrane regulatory proteins that 
elicit intracellular signals in nearly every physiological 
system of chordates and invertebrate organisms. These 
receptors are biologically important and malfunction of 
these receptors results in diseases such as Alzheimer's, 
Parkinson's, diabetes, dwarfism, color blindness, retinitis 
pigmentosa and asthma. GPCRs are also important signaling 
molecules in subjects with depression, schizophrenia, sleep- 
lessness, hypertension, anxiety, stress, renal failure and in 
several other cardiovascular, metabolic, neural, oncology 
and immune disorders (Horn and Vriend, J. Mol. Med. 
76:464-468 (1998)). They have also been shown to play a 
role in HIV infection (Feng et al., Science 272:872-877 
(1996)). 

[0007] GPCRs are integral membrane proteins character- 
ized by the presence of seven hydrophobic transmembrane 
domains which span the plasma membrane and form a 
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bundle of antiparallel alpha helices. The transmembrane 
domains account for structural and functional features of the 
receptor. In most cases, the bundle of helices forms a binding 
pocket; however, when the binding site must accommodate 
more bulky molecules, the extracellular N-terminal segment 
or one or more of the three extracellular loops participate in 
binding and in subsequent induction of conformational 
change in intracellular portions of the receptor. The activated 
receptor, in turn, interacts with an intracellular G-protein 
complex, composed of a heterotrimer of ol, p and y subunits, 
the a subunit having a bound guanosine diphosphate (GDP). 
Upon interaction of the G protein with the ligand-bound 
receptor, the G protein substitutes GTP for the GDP, causing 
a simultaneous release of the a subunit from the p and y 
subunits, and the release of all three subunits from the 
receptor. The now-activated ot subunit in turn mediates 
further intracellular signaling activities, generally through 
interaction with guanine nucleotide binding (G) proteins and 
the production of second messengers such as cyclic AMP 
(cAMP), phospholipase C, inositol triphosphate or ion chan- 
nel proteins (Baldwin, J. M. Curr. Opin. Cell Biol. 6:180- 
190 (1994)). The activity of the receptors are modulated by 
modification, such as phosphorylation, or by binding to a 
regulatory molecule, such as by the negative regulatory 
molecule arrestin, or by internalization wherein the receptor 
is degraded in a lysosome (see generally Hu, L. A., et al., J. 
Biol. Chem. 275:38659-38666 (2000)). 

[0008] The amino -terminus of the GPCR is extracellular, 
of variable length and often glycosylated, while the carboxy- 
terminus is cytoplasmic. Extracellular loops of the GPCR 
alternate with intracellular loops and link the transmembrane 
domains. The most conserved domains of GPCRs are the 
transmembrane domains and the first two cytoplasmic loops. 
GPCRs range in size from under 400 to over 1000 amino 
acids (Coughlin, S. R., Curr. Opin. Cell Biol. 6:191-197 
(1994)). 

[0009] GPCRs can be divided into five broad structural 
classes, A-E, based on amino acid sequence similarity and 
sequence motifs. The largest class is class A, which can, in 
turn, be divided into subgroups according to receptor 
sequence similarity and ligand characteristics. The catego- 
rization of these relationships is illustrated by the following 
examples: 

[0010] Class A (rhodopsin-like) GPCRs include: bio- 
genic amine receptors (e.g., a-adrenergic, p-adren- 
ergic, dopamine, histamine, muscarinic acetylcho- 
line, melatonin, 5-HT, octop amine and tyramine); 
peptidic ligand receptors (e.g., angiotensin, bomb- 
esin, chemokine, endothelin, galanin, hormone pro- 
tein, F-met-leu-phe, melanocortin, N-formyl pep- 
tide, neuropeptide Y, neurokinin, opiate, tachykinin, 
vasopressin, oxytocin and somatostatin); rhodopsin 
receptors (e.g., vertebrate rhodopsin, arthropod 
rhodopsin, and olfactory receptors); prostanoid 
receptors (e.g., prostaglandin, prostacyclin, and 
thromboxane); nucleotide receptors (e.g., adenosine 
and purinoceptors); hormone-releasing GPCRs (e.g., 
gonadotropin-releasing hormone, thyrotrop in-releas- 
ing hormone, growth hormone, and secretagogue 
GPCRs); 

[0011] Class B (secretin-like) GPCRs include calci- 
tonin, calcitonin releasing factor, calcitonin gene- 



related peptide, gastrin, cholecystokinin, glucagon, 
growth hormone-releasing hormone, parathyroid 
hormone, vasoactive intestinal peptide, PACAP, 
diuretic hormone and secretin GPCRs; 

[0012] Class C (metabotropic glutamate-like) 
GPCRs include metabotropic glutamate, metabotro- 
pic GABA B , and extracellular calcium-sensing 
GPCRs; 

[0013] Class D includes pheromone GPCRs; and 

[0014] Class E includes cAMP-binding GPCRs. 

[0015] GPCRs respond to a diverse array of ligands 
including lipid analogs, amino acids and their derivatives, 
peptides, cytokines, and specialized stimuli such as light, 
taste, and odor. GPCRs function in physiological processes 
including vision (the rhodopsins), smell (the olfactory recep- 
tors), neurotransmission (muscarinic acetylcholine, dopam- 
ine, and adrenergic receptors), and hormonal response 
(luteinizing hormone and thyroid-stimulating hormone 
receptors). 

[0016] In addition, GPCR mutations, both of the loss-of- 
function and of the activating variety, have been associated 
with numerous human diseases (Coughlin, supra). For 
instance, retinitis pigmentosa may arise from either loss-of- 
function or activating mutations in the rhodopsin gene. 
Somatic activating mutations in the thyrotropin receptor 
cause hyperfunctioning thyroid adenomas (Parma, J. et al. 
Nature 365:649-651 (1993)). Parma et al. suggest that 
certain G-protein-coupled receptors susceptible to constitu- 
tive activation may behave as proto-oncogenes. 

2.3. RHO-GTPASE ACTIVATING PROTEINS 

[0017] GAPs (GTPase activating proteins) greatly 
increase the rate of GTP hydrolysis by Got proteins and are 
thus responsible for terminating G protein activation by 
returning Fa to the GDP-bound state (Kehrl et al., Immunity 
8:1-10 (1998); Berman et al., J. Biol. Chem. 273:1269-1272 
(1998)). GDP dissociation inhibitors (GDIs) inhibit GDP 
dissociation and are responsible for keeping the G protein in 
an inactive state in resting cells (Takai et al., Int. Rev. Cytol. 
133:187-230 (1991); Bokoch et al., FASEB J. 7:750-759 
(1993)). GDP dissociation stimulators (GDSs) stimulate the 
exchange of GDP for GTP and thereby promote Ga activa- 
tion (Takai et al., Int. Rev. Cytol. 133:187-230 (1991); 
Bokoch et al., FASEB J. 7:750-759 (1993)). 

[0018] A superfamily of GTPases known as Ras proteins 
has been found to be critical in the regulation of normal and 
transformed cell growth, and control much of the informa- 
tion flow within the cell. Rho proteins are members of the 
Ras superfamily of GTPases, and are involved in the orga- 
nization of the cytoskeleton. Rho activity is regulated by the 
opposing actions of GTPase -activating proteins (GAPs) and 
guanine nucleotide exchange factors (GEFs), with GAPs 
stimulating the slow intrinsic rate of GTP hydrolysis on Ras 
and GEFs stimulating the basal rate of exchange of GDP for 
GTP on Ras. Thus, GAPs act as negative regulators of Ras 
function (Boguski & McCormick, Nature 366:643-654 
(1993)). 

[0019] GAPs can be specific to distinct physiological 
processes, but can also affect several processes through 
GTPase pathway crosstalk. At least one mammalian Rho- 
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GAP has been characterized that contains a region related to 
the C terminal domain of Ber, a RhoGEF. Whereas some 
GAPs are specific for one kind of Rho, one GAP, pi 90, is a 
"promiscuous" GAP for all Rho proteins. Adding to the 
crosstalk due to some cross-specificity of particular GAPs, 
certain GAPs may interact with each other to mediate 
physiological changes. For example, pl20-GAP binds pl90- 
GAP, linking Ras with Rho proteins to cause changes in the 
cytoskeleton (Boguski & McCormick, Nature 366:643-654 
(1993)). 

2.4. SERINE/THREONINE CLASS 2C 
PHOSPHATASES 

[0020] The class 2C serine/threonine protein phosphatases 
(PP2Cs), as the name suggests, remove phosphate groups 
from the serine and/or threonine residues of a wide variety 
of proteins. The dephosphorylation of phosphothreonine 
appears to be approximately 20-fold more efficient than 
dephosphorylation of phosphoserines, and it has been specu- 
lated that PP2C substrates are phosphorylated at threonine 
residues. The protein phosphatases have been separated into 
seven groups based on their biochemical properties (Herzig 
and Neumann, Physiol. Rev. 80(1): 173-210 (2000)). PP2C is 
a monomeric protein of approximately 382 residues. Class 
2C STPs exist in two isoforms, designated oc and p; alter- 
native splicing appears to generate the latter. Alternative 
splicing appears to further segregate the a and p isoforms 
into sub-isoforms (Deana et al., Biochim. Biophys. Acta 
1051:199-202 (1990)). 

[0021] PP2Cs have been implicated in a number of impor- 
tant biochemical pathways. In particular, it is implicated in 
the negative regulation of the MAP (mitogen activated 
protein) kinase signaling cascade. For example, PP2Ca2 is 
able to suppress the activation of p38 and JNK (Jun-N- 
terminal kinase) MAP kinases induced by environmental 
stress, wound stress and the cytokine TNF-a. (Takekawa et 
al., EMBO J. 17:4744-4752 (1998)). Because serine/threo- 
nine phosphatases are involved in such important responses, 
they are attractive target of, and candidates for, small- 
molecule inhibition and pharmacological intervention (see 
e.g., Lazo et al. U.S. Pat. No. 6,040,323). 

2.5. NF-kB-LIKE TRANSCRIPTION FACTORS 

[0022] NF-kB proteins are transcription factors. In their 
inactive form, they are complexed with the IkBo. protein in 
the cytoplasm. However, upon cell activation, they disasso- 
ciate from IkBcl, translocate to the nucleus and bind kB 
motifs in the promoters of many genes, in particular of the 
promoters of genes whose expression is involved the 
immune response. NF-kB has been implicated as a tran- 
scriptional activator in a variety of disease and inflammatory 
states and is thought to regulate cytokine levels including 
but not limited to TNF-a and also to be an activator of HIV 
transcription (Dbaibo, et al., J Biol. Chem. 17762-66 (1993); 
Duh et al., Proc. Natl. Acad. Sci. U.S.A. 86, 5974-78 (1989); 
Bachelerie et al., Nature 350:709-12 (1991); Suzuki et al., 
Biochem. Biophys. Res. Comm. 193:277-83 (1993)). In 
particular, the inappropriate regulation of NF-kB and its 
dependent genes has been associated with septic shock, 
graft -versus-host disease, acute inflammatory conditions, 
acute phase response, transplant rejection, autoimmune dis- 
eases, and cancer (Manna & Aggarwal, J. Immunol. 
165:2095-2102 (1999)). 



2.6. KELCH-LIKE PROTEINS 

[0023] Members of the kelch-repeat superfamily of pro- 
teins all contain one or more copies of a domain known as 
a p propeller (see Adams et al., Trends Cell Biol. 10: 17-24 
(2000)). The P propeller consists of 4-12 repeats of the kelch 
motif, each repeat constituting a "blade" of the propeller. 
Most members of the five categories of kelch repeats within 
the kelch superfamily have propellers having six kelch 
repeats (see Adams, supra, providing representative kelch 
motif sequences for each of the five categories). Kelch 
superfamily proteins engage in a wide variety of physiologi- 
cal functions, such as actin-binding, control of cell morphol- 
ogy and organization, and control of gene expression. Most 
keich proteins have protein binding partners, and in a 
number of proteins, it has been established that the p 
propeller facilitates the interaction (Adams, supra). Bio- 
chemical and mutational analyses provide evidence that the 
keich proteins as a class engage in multiprotein complexes 
through contact sites in their p propeller domains. 

[0024] Kelch proteins regulating gene expression include 
the protein Keapl, which sequesters Nrf2 (NF-E2-related 
factor 2) transcription factor in the cytoplasm. Another kelch 
protein, RAG-2 (recombination activating gene 2) combines 
with RAG-1 to facilitate V(D)J recombination in immuno- 
globulin and T cell receptor genes. It is the N-terminal 355 
amino acid residues of RAG-2, which form the p propeller, 
that interact with RAG-1 to cause recombination. Persons 
with mutations in the p propeller of RAG-2 suffer deficient 
RAG-1 DNA binding, and consequent severe combined 
immunity deficiency. The proteins currently characterized 
represent only a small part of a putatively extensive and 
growing superfamily (Adams, supra). 

2.7. TRANSDUCINS AND WD DOMAINS 

[0025] Transducin is a G protein essential for the exquis- 
itely tightly-regulated transmission of visual information in 
the rods of the eye. Upon stimulation by a photon, rhodop- 
sin, a prototypical GPCR, changes conformation to its active 
form, metarhodopsin, and is able to interact with transducin, 
a G protein consisting of three subunits, Ga Gp and Gy. Ga 
of transducin, like the a subunit of other G proteins, contains 
a bound GDP in its inactive state. Metarhodopsin binds 
transducin, causing the release of GDP and the binding of 
transducin to metarhodopsin. Subsequent Got binding of 
GTP causes the release of Ga both from metarhodopsin and 
from Gpy. Ga-GTP then activates its effector, cyclic GMP 
phosphodiesterase. The subsequent drop in the local con- 
centration of cGMP causes closure of cGMP-gated channels 
in the photoreceptor plasma membrane (Natochin et al., J. 
Biol. Chem. 274(12):7865-7869 (1999); Marin et al., J. Biol. 
Chem. 275(3):1930-1936 (2000)). 

[0026] Cessation of the photoresponse requires hydrolysis 
of the GTP on G^-GTP. 

[0027] However, the native GTPase activity of transducin 
is far too slow. The activity of transducin therefore, is tightly 
regulated by the protein RGS9-Gp5L, which greatly 
increases the rate of GTP hydrolysis. As the transmission of 
visual signals is on a subsecond timescale, the activation of 
transducin by metarhodopsin, and the subsequent quenching 
of the signal by RGS9-Gp5L occurs within a fraction of a 
second (Skiba et al., J. Biol. Chem. 275(42):32716-32720 
(2000)). 
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[0028] While transducin is a functional part of the visual 
system, one transducin -like protein, transducin -like 
enhancer of split (TLE), has been shown to act as part of a 
transcriptional complex in liver- specific expression. TLE 
interacts with the CRII domain of the liver-specific pleio- 
tropic transcription factor HNF3p to repress HNF3p -medi- 
ated transcription (Wang et al., J. Biol. Chem. 
275(24): 18418-18423 (2000)). 

[0029] TLP also contains a domain defined by WD 
repeats. WD repeats are similar to the ketch repeats 
described above, in that the WD repeats together form the 
blades of a "propeller." Conserved sequence motifs differ- 
entiate the WD repeat motif from that of the kelch motif. The 
best-characterized WD-repeat protein is the Gp subunit of 
heterotrimeric g proteins, which forms a tight heterodimer 
with the y subunit. The function of the WD repeat domain, 
in general, has been to facilitate the reversible interaction 
between the protein containing it and one or several other 
proteins (Smith et al., TIBS 24(5):181-5 (1999)). 

[0030] Citation or discussion of references herein above 
shall not be construed as an admission that such references 
are prior art to the present invention. 

2.8 LEUCINE-RICH REPEAT PROTEINS 

[0031] Leucine-rich repeats (LRRs) are relatively short 
motifs (22-28 residues in length) found in a variety of 
cytoplasmic, membrane and extracellular proteins associ- 
ated with widely different functions. LRRs appear to facili- 
tate protein -protein interaction. In vitro studies of a synthetic 
LRR from Drosophila Toll protein have indicated that the 
peptides form gels by adopting beta-sheet structures that 
form extended filaments. These results are consistent with 
the idea that LRRs mediate protein-protein interactions and 
cellular adhesion (Gay et al., FEES Lett. 291(1):87-91 
(1991)). Other functions of LRR-containing proteins include 
binding to enzymes (Tan et al. J Biol Chem. 265(1): 13-9 
(1990) and vascular repair (Hickey et al., Proc. Natl. Acad. 
Sci. U.S.A. 86(17):6773-7 (1989). The 3-D structure of 
ribonuclease inhibitor, a protein containing 15 LRRs, 
reveals LRRs to be a new class of alphaA^eta fold (Kobe et 
al., Nature 366(6457): 75 1-6 (1993). 

3. SUMMARY OF THE INVENTION 

[0032] We have evaluated the use of coexpression over 
many reference conditions as a method for gene discovery 
and functional characterization of unknown expressed 
sequence tags (ESTs) coregulated over many conditions 
with T cell cytokines, which are well known markers for T 
cell activation. Transcripts associated with these ESTs have 
been identified that have been found to encode novel 
polypeptides with desirable properties for targets for immu- 
nosuppressive drugs, including a G protein-coupled recep- 
tor, two GTPase- activating proteins, a serine/threonine class 
2C phosphatase, a keich motif-containing protein, two vari- 
ants of an NF-KB-like transcription factor, a transducin- 
related protein with a WD motif-containing domain, and a 
leucine-rich repeat protein. 

[0033] The present invention provides genes and proteins 
associated with T cell activation. Specifically, the invention 
relates to the T cell activation-associated proteins TA-GAP 
(a GTPase activating protein), TA-GPCR (a G protein- 
coupled receptor), TA-PP2C (a serine/threonine class 2C 



phosphatase), TA-NFKBH (an NF-kB like transcription 
factor), TA-KRP (a kelch repeat-containing protein), TA- 
WDRP (transducin-like protein), and TA-LRRP (a leucine 
repeat-rich protein), their amino acid sequences and the 
sequences of the genes and associated nucleic acids encod- 
ing them. These proteins are referred to herein as TCAPs (T 
Cell Activation- associated Proteins). Nucleic acids hybrid - 
izable to or complementary to the foregoing nucleotide 
sequences are also provided. 

[0034] The invention also relates to a method of producing 
the proteins of the present invention, and of using these 
proteins as markers for T cell activation by antibody recog- 
nition. The invention also relates to probes for hybridization 
analysis, and primers for PCR analysis, of markers of T cell 
activation. TCAPs are up regulated during T cell activation; 
thus, the invention further relates to methods of regulating 
the immune response by modifying the activity of these 
proteins or the genes that encode them. 

[0035] The invention also relates to nucleic acids contain- 
ing full-length open reading frames encoding TCAPs, iden- 
tified by the method of the invention. 

[0036] The invention also relates to TCAP derivatives that 
are functionally active, i.e., they are capable of displaying 
one or more known functional activities associated with a 
full-length (wild-type) TCAP. Such functional activities 
include but are not limited to GTPase activation activity 
(TA-GAP), GTPase activity (TA-GPCR), G-coupled protein 
receptor activity (TA-GPCR), DNA binding activity (TA- 
NFKBH), protein binding activity (TA-WDRP, 
TA-NFKBH, TA-KRP, TA-LRRP), antigenicity (i.e., the 
ability to bind or compete with a TCAP for binding) to an 
anti-TCAP antibody, immunogenicity (ability to generate 
antibody which binds to a TCAP), and ability to bind, or to 
compete with TCAPs for binding, to a receptor/ligand for a 
particular TCAP. The invention further relates to derivatives 
(including but not limited to fragments) of TCAPs that 
comprise one or more domains of a TCAP. 

[0037] Antibodies to TCAPs, or to their derivatives, are 
additionally provided. Because these antibodies detect spe- 
cific proteins correlated with T cell activation, they detect 
specific markers of T cell activation. 

[0038] The present invention further provides methods of 
production of the TCAPs and derivatives thereof, e.g., by 
recombinant means. 

[0039] The present invention also relates to therapeutic 
and diagnostic methods and compositions based on TCAPs 
and associated nucleic acids. Therapeutic compounds of the 
invention include but are not limited to TCAPs and TCAP 
derivatives, including fragments thereof; antibodies thereto; 
nucleic acids encoding the TCAPs or derivatives thereof; 
and antisense nucleic acids to the genes encoding these two 
proteins. Diagnostic methods include but are not limited to 
the detection of diseases or disorders involving T cell 
activation or a lack thereof by measuring the expression of 
one or more TCAPs or TCAP nucleic acids, where increased 
expression of the TCAP(s) or TCAP nucleic acid(s), relative 
to a standard or control or subject not having the disorder, 
indicates the presence of a disease or disorder involving 
inappropriate or undesired T cell activation, and decreased 
expression, relative to a standard or control or subject not 
having the disorder indicates the presence of a disease or 
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disorder involving a deficit in desired T cell activation. 
Diagnostic methods further include monitoring of the pro- 
duction, or suppression of production, of TCAPs by use of 
nucleic acids that hybridize to TCAP nucleic acids, and/or 
monitoring the production, or suppression of production, of 
TCAPs by use of antibodies that recognize at least one 
TCAP. 

[0040] The invention provides for treatment or prevention 
of immune disorders involving inappropriate or undesirable 
T cell activation by administering compounds that antago- 
nize TCAP activities (e.g., antibodies, antisense nucleic 
acids). The invention also provides methods of treatment or 
prevention of immune disorders involving failure of T cell 
activation, or by activation of T cells where such activation 
is desired, by administering compounds that promote TCAP 
activity, e.g., TA-GAP, TA-GPCR, TA-WDRP, TA-NFKBH, 
TA-PP2C, TA-KRP or TA-LRRP function (e.g., TA-GAP, 
TA-GPCR, TA-WDRP, TA-NFKBH, TA-PP2C, TA-KRP or 
TA-LRRP, an agonist of any of these TCAPs; nucleic acids 
that encode any of these TCAPs). In a specific embodiment, 
TCAP function is antagonized in order to suppress the 
activation of T cells, and thereby modify the immune 
response, in vivo or in vitro. 

[0041] Animal models, diagnostic methods and screening 
methods for predisposition to disorders, and methods to 
identify TCAP agonists and antagonists, are also provided 
by the invention. 

[0042] A novel T cell activation-associated protein from 
an activated Jurkat T cell line, TA-GAP has been identified. 
The cDNA sequence containing the full-length open reading 
frame encoding TA-GAP was identified through use of an 
EST (AI253155) that was co -regulated over many condi- 
tions with T cell cytokines. The nucleotide sequence of the 
cDNA containing the TA-GAP coding region has similarity 
to human BAC clone RP1-111C20 from chromosome 
6q25.3-27, which clone contains part of a novel gene 
described as similar to that encoding Chlamydomonas radial 
spoke protein 3. The amino acid sequence of TA-GAP shows 
homology to the human KIAA1391 protein (GenBank Acc. 
No. BAA92629.1), whose function is not known, and to a 
human SH3 domain-binding protein that includes a 
RhoGAP (GTPase- activator protein for Rho-like GTPases). 
The invention thus provides the polynucleotide sequence of 
the cDNA for the two splice variants encoding TA-GAP 
(FIGS. 1, 2, SEQ ID NOS: 1, 2) and vectors and host cells 
comprising TA-GAP for use in immunosuppressive drug 
development. The invention also provides the amino acid 
sequence of two TA-GAP variants (FIGS. 1, 2, SEQ ID 
NOS: 3, 4), a method of recombinantly producing TA-GAP 
for use as a target, and a method for producing antibodies 
directed against TA-GAP. 

[0043] Also identified is T Cell Activation-associated Pro- 
tein TA-GPCR. TA-GPCR was identified by analysis of a 
transcript corresponding to the EST AA040696, which was 
co-regulated with cytokine transcripts. Through PCR of 
actual transcripts, two cDNAs containing full-length open 
reading frames were identified that encode the same protein, 
TA-GPCR. TA-GPCR shows homology to a putative 

chemokine receptor (GenBank Acc. No. NP 006009.1) and 

a putative seven transmembrane spanning receptor of the 
rhodopsin family (GanBank Acc. No. CAC17790). The 
invention thus provides the nucleotide sequence of the two 



cDNAs encoding full-length TA-GPCR (FIGS. 3A-3D, 
4A-4C; SEQ ID NOS: 5, 6) and vectors and host cells 
comprising a TA-GPCR-encoding nucleic acid sequence for 
use in immunosuppressive drug development. The invention 
also provides the amino acid sequence of TA-GPCR (FIGS. 
3A-3D, 4A-4C; SEQ ID NO: 7). 

[0044] Also identified in the same manner are: (1) 
TA-PP2C, predicted to be a serine/threonine class 2C phos- 
phatase; (2) 

(3) TA-KRP a protein 
containing a POZ/BTB domain and three kelch repeats; (4) 
TA-WDRP, a transducin-like protein containing 11 WD 
repeats; and (5) TA-LRRP, a protein containing four trans- 
membrane-domains and 12 leucine-rich repeats. The inven- 
tion thus provides the nucleotide sequence of cDNAs encod- 
ing the above full-length proteins (FIGS. 5-10; SEQ ID 
NOS: 8, 10, 12, 14, 16, 18, respectively) and vectors and 
host cells comprising a TA-PP2C-, TA-NFKIBH-, 
TA-KRP-, TA-WDRP-, or TA-LRRP-encoding nucleic acid 
sequence for use in immunosuppressive drug development. 
The invention also provides the amino acid sequence of 
TA-PP2C, TA-NFKBH, TA-KRP, TA-WDRP, and 
TA-LRRP (FIGS. 5-10; SEQ ID NO: 9, 11, 13, 15, 17, 19, 
respectively). These proteins, and the related genes, have not 
been previously identified. 

3.1. DEFINITIONS 

[0045] As used herein, underscoring or italicizing the 
name of a gene shall indicate the gene, in contrast to its 
encoded protein product which is indicated by the name of 
the gene in the absence of any underscoring or italicizing. 
For example, "TA-GPCR" shall mean the gene encoding the 
protein product "TA-GPCR." 

4. DESCRIPTION OF THE FIGURES 

[0046] FIGS. 1A-1E show a 3218 nucleotide cDNA 
sequence (SEQ ID NO: 1) encoding TA-GAP and the 
predicted 731 amino acid-long sequence of TA-GAP (SEQ 
ID NO: 2). 

[0047] FIGS. 2A-2D show a 3051 nucleotide cDNA 
sequence (SEQ ID NO: 3) encoding a splice variant of 
TA-GAP and the predicted 553 amino acid-long sequence of 
a variant of TA-GAP (SEQ ID NO: 4). 

[0048] FIGS. 3A-3E show a 3612 nucleotide cDNA 
sequence (SEQ ID NO: 5) encoding TA-GPCR and the 
predicted 346 amino acid-long sequence of TA-GPCR (SEQ 
ID NO: 7). 

[0049] FIGS. 4A-4D show a 2345 nucleotide cDNA 
sequence (SEQ ID NO: 6) encoding TA-GPCR and the 
predicted 346 amino acid-long sequence of TA-GPCR (SEQ 
ID NO: 7). 

[0050] FIGS. 5A-5G show a 3748 nucleotide cDNA 
sequence (SEQ ID NO: 8) encoding TA-PP2C and the 
predicted 304 amino acid-long sequence of TA-PP2C (SEQ 
ID NO: 9). 

[0051] FIGS. 6A-6C show an 1736 nucleotide cDNA 
sequence (SEQ ID NO: 10) encoding a long form of 
TA-NFKBH and the predicted 465 amino acid-long 
sequence of the long form of 
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[0052] FIGS. 7A-7D show an 1834 nucleotide cDNA 
sequence (SEQ ID NO: 12) encoding a short form of 
TA-NFKBH and the predicted 313 amino acid-long 
sequence of tht 

[0053] FIGS. 8A-8F show a 3049 nucleotide cDNA 
sequence (SEQ ID NO: 14) encoding TA-WDRP and the 
predicted 951 amino acid-long TA-WDRP (SEQ ID NO: 
15). 

[0054] FIGS. 9A-9H show a 4617 nucleotide cDNA 
sequence (SEQ ID NO: 16) encoding TA-KRP and the 
predicted 575 amino acid-long TA-KRP (SEQ ID NO: 17). 

[0055] FIGS. 10A-10E show a 3588 nucleotide cDNA 
sequence (SEQ ID NO: 18) encoding TA-LRRP and the 
predicted 803 amino acid-long TA-LRRP (SEQ ID NO: 19). 

[0056] FIG. 11 diagrams the relative sizes of TA-GPCR, 
TA-GAP (long and short forms), TA-LRRP, TA-WDRP, 
TA-KRP, TA-NFKBH (long and short forms), and 
TA-PP2C. Specific domains or sequence motifs present in 
each are indicated as gray boxes. 

[0057] FIG. 12 shows co -clustering of known cytokines 
and unknown ESTs in expression profiling experiments. 
FlexJet™ arrays representing either 25,000 or 50,000 Uni- 
gene clusters were hybridized to a mixture of cRNAs from 
untreated versus treated cells of various types. The experi- 
ments contained comparisons of activated and unactivated 
Jurkat cells; K562 cells; peripheral blood T cells; THP1 
cells; NB4 cells; JCAM cells; HL60 cells; and B-lympho- 
blast cells. A total of 3853 genes regulated >3-fold, P<0.01 
in a total of 104 experiments were analyzed by a two 
dimensional hierarchical clustering algorithm. Genes were 
grouped by greatest similarity of regulation over all experi- 
ments (Y axis) and the experiments showing the greatest 
similarities in gene regulation (X axis). Only a section of the 
total data set is shown (64 genes and 94 experiments). 
Experiments involving activated peripheral blood T cells 
and activated Jurkat T cells are indicated with horizontal 
black bars. Genes up regulated in a particular experiment are 
colored medium gray; genes down regulated in that experi- 
ment are colored light gray; and genes showing no regula- 
tion in a particular experiment are colored black. The set of 
genes shown here demonstrates enrichment for T cell cytok- 
ines. Known cytokine genes are highlighted on the right 
hand Y axis with light gray circles. This region also contains 
21 ESTs of unknown function, indicated with dark gray 
circles. 

[0058] FIG. 13 shows linkage of two Unigene clusters by 
genomic tiling. 

[0059] FIG. 13 A depicts the mapping of the consensus 
sequences from two previously unlinked Unigene clusters, 
Hs. 7581 and Hs. 130864 to a portion of human chromosome 
6. 

[0060] FIG. 13B depicts a portion of an array containing 
oligonucleotides from the genomic sequence surrounding 
two Unigene EST clusters, Hs. 7581 and Hs. 130864, on 
chromosome 6. Nested oligonucleotides (60 bp) were 
selected from every tenth nucleotide position of both strands 
of non-repetitive sequence in alternating fashion. The array 
was hybridized with a mixture of cRNA from activated 
(labeled with red fluorescent dye) and unactivated (labeled 



with green fluorescent dye) Jurkat cells. The red-fluorescing 
dots (shown as dark gray) represent oligonucleotides show- 
ing greater hybridization to a transcript expressed at higher 
levels in activated cells, whereas yellow spots (shown as 
white) show equal hybridization with both samples. The 
white circles show indicate the boundaries of a contiguous 
segment of genomic DNA hybridizing with a transcript 
present at higher levels in activated cells. The top circle 
maps near the 5' end of Hs. 130864 and the bottom circle, 
near the 3' end of Hs. 7581. The contiguous hybridization 
suggests that this region hybridizes with a single transcript. 

[0061] FIG. 13C depicts a graph showing XDEV mea- 
surements of hybridization over the region of chromosome 
6 adjacent to Unigene clusters, Hs. 7581 and Hs. 130864. 
For a description of the calculation of XDEV, see Example 
3, infra. The region between the white circles from part B 
corresponds to the peak of XDEV measurements. 

[0062] FIG. 13D depicts linking by tiling data of Unigene 
clusters, Hs. 7581 and Hs. 130864. The previously known 
boundaries of these clusters are shaded dark gray; the region 
between these (shown in white) was predicted to hybridize 
with the same transcript by hybridization data in FIGS. 6B 
and 6C. The linkage of these EST clusters was confirmed by 
RT-PCR analysis. Further extension of these EST clusters by 
RT-PCR analysis revealed that this genomic region repre- 
sents an exon from the 3' untranslated region of the human 
homolog of the transcription factor, Bach2. 

[0063] FIG. 14 shows the upregulation of TA-GAP during 
T cell activation. Transcripts from activated Jurkat T cells 
showing significant regulation (>2-fold change and 
P<0.0001 in most samples) over the unactivated condition 
are depicted as thin light gray lines. R/G ratio is above 0.0 
when a particular gene is upregulated. The TA-GAP tran- 
script is depicted by the thick black line (indicated by the 
arrow); transcripts for 18 other GAP domain-containing 
proteins are depicted by thin black lines (KIAA1501, 
KIAA0660, A1479025, ABR, GIT1, GIT2, ARHGAP1, 
ARHGAP4, G38P, GAPCENA, GAPL, IQGAP1, IQGAP2, 
NGAP, RAB3GAP, RANGAP1, RAP1GA1, RASA1). Of 
the transcripts tested that encode GAP-domain containing 
proteins, TA-GAP is the only one to show significant 
upregulation during T cell activation. 

[0064] FIG. 15 shows upregulation of TA-GPCR during T 
cell activation. Transcripts from activated Jurkat cells show- 
ing significant regulation (>2-fold change and P<0.0001 in 
most samples) are depicted as thin light gray lines. Tran- 
scripts encoding GPR proteins are depicted as black lines. 
The R/G ratio is above 0.0 when a particular gene is 
upregulated. The TA-GPCR transcript is depicted by the 
thick black line, and transcripts for 27 other GPR proteins 
are depicted by thin black lines (GPR39, GPR51, AI61367, 
AI208357, GPRK6, GPRK5, GPR51, GPR19, AI659657, 
GPR48, EBI2, GPRK5, GPRK6, GPR68, GPR4, GPR9, 
LANCL1, CCR1, CCR4, CCR5, CCR7, CCR8, CMKLR1, 
CXCR4, HM74, LTBR4, AA040696). Of the transcripts 
tested that encode GPRs, TA-GPCR was the only one to 
show significant upregulation. 

5. DETAILED DESCRIPTION OF THE 
INVENTION 

[0065] The present invention relates to the amino acid 
sequences of the T Cell Activation associated proteins 
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TA-GAP, TA-GPCR, TA-PP2C, TA-NFKBH, TA-KRP, TA- 
WDRP and TA-LRRP (referred to hereinafter individually 
and as a group as "TCAPs"), and to nucleotide sequences of 
the genes encoding these proteins. SEQ ID NO: 1 is a cDNA 
sequence containing a full open reading frame that encodes 
TA-GAP (SEQ ID NO: 2). SEQ ID NO: 3 is a cDNA 
sequence containing a full open reading frame that encodes 
a splice variant of TA-GAP (SEQ ID NO: 4). TA-GAP has 
high sequence similarities to known GTPase -activating pro- 
teins, and likely possesses this function and is involved in 
the modulation of signal transduction. SEQ ID NO: 5 and 6 
are distinct cDNAs, both of which contain full open reading 
frames that encode the same TA-GPCR (SEQ ID NO: 7). 
TA-GPCR shows high sequence similarity to known G 
protein coupled receptors, and likely plays a significant role 
in signal transduction. SEQ ID NO: 8 is a cDNA sequence 
containing a full open reading frame that encodes TA-PP2C 
(SEQ ID NO: 9). TA-PP2C shows high sequence similarity 
to known serine-threonine proteases, has a PP2C box, and 
likely functions to modulate signal transduction. SEQ ID 



TA-NFKBH has sequence similarity to known transcription 
factors, contains five Ankyrin repeats, and may play a part 
in gene regulation during T cell activation. SEQ ID NO: 14 
is a cDNA sequence containing a full open reading frame 
that encodes TA-WDRP (SEQ ID NO: 15). TA-WDRP is a 
transducin-like protein with eleven WD repeats; based on its 
structural similarities with transducin, TLP is likely a G 
protein. SEQ ID NO: 16 is a cDNA sequence containing a 
full open reading frame that encodes TA-KRP (SEQ ID NO: 
17). TA-KRP has three kelch repeat motifs and a POZ/BTB 
domain, and may be involved in G-protein signaling. SEQ 
ID NO: 18 is a cDNA sequence containing a full open 
reading frame that encodes TA-LLRP (SEQ ID NO: 19). 
TA-LLRP is a leucine -repeat rich protein. Diagrams of each 
of these proteins, showing their relative sizes and the posi- 
tions of each of the domains noted above, are provided in 
FIG. 11. 

[0066] The invention further relates to fragments and other 
derivatives of the above TCAPs. Nucleic acids encoding 
such fragments or derivatives are also within the scope of the 
invention. The invention provides TCAP-encoding genes 
("TCAP genes") and their encoded proteins of many differ- 
ent species. As used herein, "TCAP genes" includes cDNAs 
or other nucleic acids encoding a TCAP in whole or in part. 
The TCAP genes of the invention include human and related 
genes (homologs) in other species. In specific embodiments, 
the TCAP genes and proteins are from vertebrates, or more 
particularly, mammals. In a preferred embodiment of the 
invention, the TCAP genes and proteins are of human origin. 
Production of the foregoing proteins and derivatives, e.g., by 
recombinant methods, is provided. 

[0067] The invention also relates to TCAP derivatives of 
the invention that are functionally active, i.e., they are 
capable of displaying one or more known functional activi- 
ties associated with a full-length (wild-type) TCAPs. Such 
functional activities include but are not limited to activation 
of GTPases (TA-GAP), indirect activation of membrane- 
bound enzymes or ion channels (TA-GPCR); transcriptional 
activation (KBTF); phosphatase activity (TA-PP2C); 
GTPase activity and the ability to interact with GPCRs 
(TA-TCP); antigenicity (i.e., the ability to bind, or compete 



with a TCAP for binding, to an anti-TCAP antibody; immu- 
nogenicity (ability to generate an antibody which binds to a 
TCAP); ability to bind, or compete with TCAP for binding, 
to an TCAP-domain-containing protein or other ligand. 

[0068] The invention further relates to fragments, and 
derivatives thereof, of TCAPs that comprise one or more 
domains of the TCAPs. 

[0069] Antibodies to TCAPs, their derivatives, are addi- 
tionally provided. 

[0070] The present invention also relates to therapeutic 
and diagnostic methods and compositions based on TCAPs, 
TCAP nucleic acids and anti-TCAP antibodies. The inven- 
tion provides for immunosuppression by administering com- 
pounds that inhibit or antagonize TCAP activity (e.g., 
antagonists of a TCAP; antisense molecules directed to the 
genes encoding a TCAP; antibodies to a TCAP). 

[0071] Animal models, diagnostic methods and screening 
methods for predisposition to disorders are also provided by 
the invention. 

[0072] The invention is illustrated by way of examples 
infra which disclose, inter alia, the cloning and character- 
ization of the TCAPs (Section 6). 

[0073] For clarity of disclosure, and not by way of limi- 
tation, the detailed description of the invention is divided 
into the subsections which follow. 

5.1. ISOLATION OF THE TCAP GENES 

[0074] The invention relates to the nucleotide sequences 
of nucleic acids. In a specific embodiment, the inventor 
relates to nucleic acids that encode a TCAP. In a more 
specific embodiment, the invention relates to nucleic acids 
that encode TA-GAP, TA-GPCR, TA-PP2C, TA-NFKBH, 
TA-KRP, TA-WDRP, or TA-LRRP. In further specific 
embodiments, TA-GAP nucleic acids comprise the cDNA 
sequences of SEQ ID NO: 1 or SEQ ID NO: 2, or the coding 
regions thereof, or nucleic acid encoding TA-GAP (e.g., a 
protein having the sequence of SEQ ID NO: 3 or SEQ ID 
NO: 4). In another specific embodiment, TA-GPCR nucleic 
acids comprise the cDNA sequences of SEQ ID NO: 5 or 
SEQ ID NO: 6, or the coding regions thereof, or nucleic acid 
encoding TA-GPCR, (e.g., a protein having the sequence of 
SEQ ID NO: 7). In another specific embodiment, TA-PP2C 
nucleic acids comprise the cDNA sequence of SEQ ID NO: 
8, or the coding regions thereof, or nucleic acid encoding 
TA-PP2C (e.g., a protein having the sequence of SEQ ID 
NO: 9). In another specific embodiment, TA-NFKBH 
nucleic acids comprise the cDNA sequences of SEQ ID NO: 
10 or SEQ ID NO: 12, or the coding regions thereof, or 
nucleic acid encoding TA-GPCR, (e.g., a protein having the 
sequence of SEQ ID NO: 11 or 13). In another specific 
embodiment, TA-WDRP nucleic acids comprise the cDNA 
sequence of SEQ ID NO: 14, or the coding regions thereof, 
or nucleic acid encoding TA-WDRP, (e.g., a protein having 
the sequence of SEQ ID NO: 15). In another specific 
embodiment, TA-KRP nucleic acids comprise the cDNA 
sequences of SEQ ID NO: 16, or the coding regions thereof, 
or nucleic acid encoding TA-KRP, (e.g., a protein having the 
sequence of SEQ ID NO: 17). In another specific embodi- 
ment, TA-LRRP nucleic acids comprise the cDNA sequence 
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of SEQ ID NO: 18, or the coding regions thereof, or nucleic 
acid encoding TA-LRRP, (e.g., a protein having the 
sequence of SEQ ID NO: 19). 

[0075] The invention provides purified nucleic acids con- 
sisting of at least 10 nucleotides (i.e., a hybridizable portion) 
of a nucleotide sequence encoding a TCAP; in other embodi- 
ments, the nucleic acids consist of at least 10, 20, 50, 100, 
150, or 200 contiguous nucleotides of a nucleotide sequence 
encoding a TCAP, or a full-length coding sequence. In 
another embodiment, the nucleic acids are smaller than 35, 
200 or 500 nucleotides in length. Nucleic acids can be single 
or double stranded. In another embodiment, the nucleic 
acids comprise a sequence of at least 10 nucleotides that 
encode a fragment of a TCAP, wherein the fragment of the 
TCAP displays one or more functional activities of the 
TCAP, or contains a functional domain or motif of the 
TCAP. In no event, however, does the invention provide for 
a contiguous nucleic acid sequence present in the GenBank 
search results provided in the Examples in Section 6. 

[0076] The invention also relates to nucleic acids hybrid- 
izable to or complementary to the foregoing sequences. In 
specific aspects, nucleic acids are provided which comprise 
a sequence complementary to at least 10, 25, 50, 100, or 200 
nucleotides or the entire coding region of a gene encoding a 
TCAP, or the reverse complement (antisense) of any of these 
sequences. In a specific embodiment, a nucleic acid which is 
hybridizable to a TA-GAP nucleic acid (e.g., having part or 
the whole of sequence SEQ ID NO: 1 or SEQ ID NO: 2, or 
the complement thereof), or to a nucleic acid encoding a 
TA-GAP derivative, under conditions of low stringency is 
provided. In another specific embodiment, a nucleic acid 
which is hybridizable to a TA-GPCR nucleic acid (e.g., 
having part or the whole of SEQ ID NO: 5 or SEQ ID NO: 
6, or the complement thereof), or to a nucleic acid encoding 
a TA-GPCR derivative, under conditions of low stringency 
is provided. In further specific embodiment, a nucleic acid 
which is hybridizable to a TA-PP2C nucleic acid (e.g., 
having part or the whole of SEQ ID NO: 8, or the comple- 
ment thereof), or to a nucleic acid encoding a TA-PP2C 
derivative, under conditions of low stringency is provided. 
In a further specific embodiment, a nucleic acid which is 
hybridizable to a TA-NFKBH nucleic acid (e.g., having part 
or the whole of SEQ ID NO: 10 or 12, or the complement 
thereof), or to a nucleic acid encoding a TA-NFKBH deriva- 
tive, under conditions of low stringency is provided. In 
another specific embodiment, a nucleic acid which is hybrid- 
izable to a TA-WDRP nucleic acid (e.g., having part or the 
whole of SEQ ID NO: 14, or the complement thereof), or to 
a nucleic acid encoding a TA-WDRP derivative, under 
conditions of low stringency is provided. In yet a further 
specific embodiment, a nucleic acid which is hybridizable to 
a TA-KRP nucleic acid (e.g., having part or the whole of 
SEQ ID NO: 16, or the complement thereof), or to a nucleic 
acid encoding a TA-KRP derivative, under conditions of low 
stringency is provided. In yet a further specific embodiment, 
a nucleic acid which is hybridizable to a TA-LRRP nucleic 
acid (e.g., having part or the whole of SEQ ID NO: 18, or 
the complement thereof), or to a nucleic acid encoding a 
TA-LRRP derivative, under conditions of low stringency is 
provided. 

[0077] By way of example and not limitation, procedures 
using such conditions of low stringency are as follows (see 
also Shilo and Weinberg, Proc. Natl. Acad. Sci. U.S.A. 



78:6789-6792 (1981)): Filters containing DNA are pre- 
treated for 6 h at 40° C. in a solution containing 35% 
formamide, 5xSSC, 50 mM Tris-HCl (pH 7.5), 5 mM 
EDTA, 0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 /ig/ml 
denatured salmon sperm DNA. Hybridizations are carried 
out in the same solution with the following modifications: 
0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 jug g/ml salmon 
sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20xl0 6 
cpm 32 P-labeled probe is used. Filters are incubated in 
hybridization mixture for 18-20 h at 40° C, and then washed 
for 1.5 h at 55° C. in a solution containing 2xSSC, 25 mM 
Tris-HCl (pH 7.4), 5 mM EDTA, and 0.1% SDS. The wash 
solution is replaced with fresh solution and incubated an 
additional 1.5 h at 60° C. Filters are blotted dry and exposed 
for autoradiography. If necessary, filters are washed for a 
third time at 65-68° C. and reexposed to film. Other condi- 
tions of low stringency which may be used are well known 
in the art (e.g., as employed for cross-species hybridiza- 
tions). 

[0078] In another specific embodiment, a nucleic acid 
hybridizable to a nucleic acid encoding a TCAP, or its 
inverse complement, under conditions of high stringency is 
provided. By way of example and not limitation, procedures 
using such conditions of high stringency are as follows. 
Prehybridization of filters containing DNA is carried out for 
8 h to overnight at 65° C. in buffer composed of 6xSSC, 50 
mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.02% BSA, and 500 //g/ml denatured salmon sperm 
DNA. Filters are hybridized for 48 h at 65° C. in prehy- 
bridization mixture containing 100 jug/ml denatured salmon 
sperm DNA and 5-20xl0 6 cpm of 32 P-labeled probe. Wash- 
ing of filters is done at 37° C. for 1 h in a solution containing 
2xSSC, 0.01% PVP, 0.01% Ficoll, and 0.01% BSA. This is 
followed by a wash in O.lxSSC at 50° C. for 45 min before 
autoradiography. Other conditions of high stringency that 
may be used are well known in the art. 

[0079] In another specific embodiment, a nucleic acid that 
is hybridizable to a nucleic acid encoding a TCAP under 
conditions of high stringency is provided (see, e.g., Section 
5.1). 

[0080] Nucleic acids hybridizable to the complement of 
the above-mentioned sequences are also provided. 

[0081] The above-mentioned nucleic acids preferably also 
encode a protein displaying one or more functional activities 
of a TCAP or a domain or motif thereof. 

[0082] Nucleic acids encoding derivatives of TCAPs (see 
Sections 5.6 and 5.6.1), and antisense nucleic acids to genes 
encoding TCAPs (see Section 5.7.3.1.1) are additionally 
provided. As is readily apparent, as used herein, a nucleic 
acid encoding a "fragment" or "portion" of a TCAP shall be 
construed as referring to a nucleic acid encoding only the 
recited fragment or portion of the specific TCAP and not the 
other contiguous portions of the specific TCAP protein as a 
continuous sequence. 

[0083] Fragments of nucleic acids encoding the TCAPs 
described above, which comprise regions conserved 
between (i.e., having homology or identity to) other TCAP- 
encoding nucleic acids of the same or different species, are 
also provided. Nucleic acids encoding one or more domains 
of a specific TCAP are provided. 

[0084] Fragments or derivatives of TCAP nucleic acids 
that hybridize specifically to one or more TCAP nucleic 
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2010 


nn 


.sub 


2010- 


nn.sub.690-nn.sub.700; nn.sub.700-nn.sub.710; nn.sub. 710- 


nn 


sub 


2020 


nn 


sub 


2020- 


-nn 


sub 


2030 


nn 


.sub 


2030- 


nn.sub.720; nn.sub.720-nn.sub.730; nn.sub. 730-nn.sub. 740; 


nn 


sub 


2040 


nn 


sub 


2040- 


-nn 


sub 


2050 


nn 


.sub 


2050- 


nn.sub.740-nn.sub.750; nn.sub.750 -nn.sub.760; 


nn 


sub 


2060 


nn 


sub 


2060- 


-nn 


sub 


2070 


nn 


.sub 


2070- 


nn.sub.760-nn.sub.770; nn.sub.770-nn.sub.780; nn.sub. 780- 


nn 


sub 


2080 


nn 


sub 


2080- 


-nn 


sub 


2090 


nn 


.sub 


2090- 


nn.sub.790; nn.sub.790-nn.sub.800; nn.sub.800-nn.sub.810; 


nn 


sub 


2100 


nn 


.sub 


2100 


-nn 


.sub 


.2110 


nn 


.sub 


.2110- 


nn.sub.810-nn.sub.820; nn.sub.820 -nn.sub.830; 


nn 


sub 


2120 


nn 


sub 


2120- 


-nn 


sub 


2130 


nn 


.sub 


2130- 


nn.sub.830-nn.sub.840; nn.sub.840-nn.sub.850; nn.sub.850- 


nn 


sub 


2140 


nn 


sub 


2140- 


-nn 


sub 


2150 


nn 


.sub 


2150- 


nn.sub.860; nn.sub.860-nn.sub.870; nn.sub.870-nn.sub.880; 


nn 


sub 


2160 


nn 


sub 


2160- 


-nn 


sub 


2170 


nn 


.sub 


2170- 


nn.sub.880-nn.sub.890; nn.sub.890 -nn.sub.900; 


nn 


sub 


2180 


nn 


sub 


2180- 


-nn 


sub 


2190 


nn 


.sub 


2190- 


nn.sub.900-nn.sub.910; nn.sub.910-nn.sub.920; nn.sub.920- 


nn 


sub 


2200 


nn 


sub 


2200- 


-nn 


sub 


2210 


nn 


.sub 


2210- 


nn.sub.930; nn.sub.930-nn.sub.940; nn.sub.940-nn.sub.950; 


nn 


sub 


2220 


nn 


sub 


2220- 


-nn 


sub 


2230 


nn 


.sub 


2230- 


nn.sub.950-nn.sub.960; nn. sub. 9 60 -nn. sub. 970; 


nn 


sub 


224U 


nn 


sub 


ZZ40- 


-nn 


sub 


ZZdO 


nn 


.sub 


ZZdu- 


nn.sub.970-nn.sub.980; nn.sub.980-nn.sub.990; nn.sub.990- 


nn 


sub 


2260 


nn 


sub 


2260- 


-nn 


sub 


2270 


nn 


.sub 


2270- 


nn.sub.1000 


nn.sub.1000-nn.sub.1010; nn.sub. 1010- 


nn 


sub 


2280 


nn 


sub 


2280- 


-nn 


sub 


2290 


nn 


.sub 


2290- 


nn.sub.1020 


nn.sub.1020-nn.sub.1030; nn.sub. 1030- 


nn 


sub 


2300 


nn 


sub 


2300- 


-nn 


sub 


2310 


nn 


.sub 


2310- 


nn.sub.1040 


nn.sub.1040-nn.sub.1050; nn.sub. 1050- 


nn 


sub 


2320 


nn 


sub 


2320- 


-nn 


sub 


2330 


nn 


.sub 


2330- 


nn.sub.1060 


nn.sub.1060-nn.sub.1070; nn.sub. 1070- 


nn 


sub 


2340 


nn 


sub 


2340- 


-nn 


sub 


2350 


nn 


.sub 


2350- 


nn.sub.1080 


nn.sub.1080-nn.sub.1090; nn.sub. 1090- 


nn 


sub 


2360 


nn 


sub 


2360- 


-nn 


sub 


2370 


nn 


.sub 


2370- 


nn.sub.1100 


nn.sub.1100-nn.sub.1110; nn.sub. 1110- 


nn 


sub 


2380 


nn 


sub 


2380- 


-nn 


sub 


2390 


nn 


.sub 


2390- 


nn. sub. 1120 


nn.sub.1120-nn.sub.1130; nn.sub. 1130- 


nn 


sub 


2400 


nn 


sub 


2400- 


-nn 


sub 


2410 


nn 


.sub 


2410- 


nn. sub. 1140 


nn.sub . 1 140-nn.sub . 1 150; nn.sub . 1 150- 


nn 


sub 


2420 


nn 


sub 


2420 


-nn 


sub 


2430 


nn 


.sub 


2430- 


nn. sub. 11 60 


nn.sub.1160-nn.sub.1170; nn.sub. 1170- 


nn 


sub 


2440 


nn 


sub 


2440- 


-nn 


sub 


2450 


nn 


.sub 


2450- 
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nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 
nn.sub. 



2460 
2480 
2500 
2520 
2540 
2560 
2580 
2600 
2620 
2640 
2660 
2680 
2700 
2720 
2740 
2760 
2780 
2800 
2820 
2840 
2860 
2880 
2900 
2920 
2940 
2960 
2980 
3000 
3020 
3040 
3060 
3080 
3100 
3120 
3140 
3160 
3180 
3200 
3218 



nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 



2460- 
2480- 
2500- 
2520- 
2540- 
2560- 
2580- 
2600- 
2620- 
2640- 
2660- 
2680- 
2700- 
2720- 
2740- 
2760- 
2780- 
2800- 
2820- 
2840- 
2860- 
2880- 
2900- 
2920- 
2940- 
2960- 
2980- 
3000- 
3020- 
3040- 
3060- 
3080- 
.3100 
3120- 
3140- 
3160- 
3180- 
3200- 



nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
■nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 



2470 
.2490 
2510 
2530 
2550 
2570 
2590 
.2610 
2630 
.2650 
2670 : 
.2690 
2710 : 
2730 
2750 
.2770 : 
2790 
.2810 
2830 : 
.2850 : 
.2870 : 
2890 : 
2910 
.2930 
2950 
.2970 
.2990 
.3010 
.3030 
.3050 
3070 
3090 
.3110 
.3130 
3150 
3170 
.3190 
3210 



nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 
nn.sub 



2470- 
.2490- 
2510- 
2530- 
2550- 
2570- 
2590- 
.2610- 
2630- 
2650- 
2670- 
2690- 
.2710- 
2730- 
2750- 
.2770- 
2790- 
.2810- 
2830- 
.2850- 
.2870- 
2890- 
2910- 
.2930- 
.2950- 
.2970- 
.2990- 
3010- 
3030- 
3050- 
3070- 
3090- 
.3110- 
.3130- 
3150- 
3170- 
.3190- 
3208- 



[0086] Sequences suitable for hybridization to SEQ ID 
NO: 3, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 8, SEQ 
ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 
16 or SEQ ID NO: 18 or their complements may be obtained 
in similar fashion. 

[0087] The invention also provides nucleic acids compris- 
ing nucleotide sequences of at least 30, 50, 60, 70, 90, 95 or 
99% homologous to a nucleotide sequence of a TCAP gene 
or a portion thereof. "Homologous" means that in various 
embodiments, the aligned first nucleotide sequence has 
preferably at least 30% or 50%, more preferably 60% or 
70%, even more preferably at least 80% or 90%, and even 
more preferably at least 95% identity to a second nucleotide 
sequence over a nucleotide sequence length equal to the 
shorter of the two sequences, plus any introduced gaps. 
When the alignment is done by a computer homology 
program known in the art, such as BLAST (blastn), the 
percent homology is calculated by dividing the number of 
nucleotides in the TCAP-encoding nucleic acid sequence or 
fragment thereof exactly matching the nucleotide at the 
same position in the aligned sequence by the length of the 
alignment in nucleotides, including introduced gaps, where 
introduced gaps count as mismatches. 



[0088] Specific embodiments for the cloning of a gene 
encoding a TCAP, presented as a particular example but not 
by way of limitation, follows: 

[0089] For expression cloning (a technique commonly 
known in the art), an expression library is constructed by 
methods known in the art. For example, mRNA (e.g., 
human) is isolated, cDNA is made and ligated into an 
expression vector (e.g., a bacteriophage derivative) such that 
it is capable of being expressed by the host cell into which 
it is then introduced. Various screening assays can then be 
used to select for the expressed TCAP product. In one 
embodiment, anti-TA-GAP antibodies can be used for selec- 
tion. In another embodiment, anti-TA-GPCR antibodies can 
be used for selection. In another embodiment, anti-TA- 
NFKBH antibodies can be used for selection. In yet another 
embodiment, anti-TA-KRP antibodies can be used for selec- 
tion. In yet a further embodiment, anti-TA -PP2C antibodies 
can be used for selection. In another embodiment, anti-TA- 
LRRP antibodies can be used for selection. In yet another 
embodiment, anti-TA-LRRP antibodies can be used for 
selection. 

[0090] In another embodiment of the invention, poly- 
merase chain reaction (PCR) is used to amplify the desired 
sequence in a genomic or cDNA library, prior to selection. 
Oligonucleotide primers representing known TCAP-encod- 
ing sequences can be used as primers in PCR. In a preferred 
aspect, the oligonucleotide primers represent at least part of 
the conserved segments of strong homology between TCAP- 
encoding genes of different species, for example transmem- 
brane domains, WD repeat domains, kelch motifs, (3 pro- 
pellers, Ank-repeat domains, leucine-rich regions and 
ligand-binding domains. The synthetic oligonucleotides 
may be utilized as primers to amplify by PCR sequences 
from RNA or DNA, preferably a cDNA library, of potential 
interest. Alternatively, one can synthesize degenerate prim- 
ers for use in the PCR reactions. 

[0091] In PCR according to the invention, the nucleic acid 
being amplified can include RNA or DNA, for example, 
mRNA, cDNA or genomic DNA from any eukaryotic spe- 
cies. PCR can be carried out, e.g., by use of a Perkin-Elmer 
Cetus thermal cycler and Taq polymerase. It is also possible 
to vary the stringency of hybridization conditions used in 
priming the PCR reactions, to allow for greater or lesser 
degrees of nucleotide sequence similarity between a known 
TCAP nucleotide sequence and a nucleic acid homolog 
being isolated. For cross-species hybridization, low strin- 
gency conditions are preferred. For same-species hybridiza- 
tion, moderately stringent conditions are preferred. After 
successful amplification of a segment of a TCAP gene 
homolog, that segment may be cloned, sequenced, and 
utilized as a probe to isolate a complete cDNA or genomic 
clone. This, in turn, will permit the determination of the 
gene's complete nucleotide sequence, the analysis of its 
expression, and the production of its protein product for 
functional analysis, as described infra. In this fashion, addi- 
tional genes encoding TCAPs and TCAP may be identified. 

[0092] The above recited methods are not meant to limit 
the following general description of methods by which 
clones of genes encoding TCAPs may be obtained. 

[0093] Any eukaryotic cell potentially can serve as the 
nucleic acid source for the molecular cloning of a TCAP- 
encoding gene. The nucleic acid sequences encoding TCAPs 
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can be isolated from vertebrate, mammalian, human, por- 
cine, bovine, feline, avian, equine, canine, as well as addi- 
tional primate sources. The DNA may be obtained by 
standard procedures known in the art from cloned DNA 
(e.g., a DNA "library"), by chemical synthesis, by cDNA 
cloning, or by the cloning of genomic DNA, or fragments 
thereof, purified from the desired cell. (See, for example, 
Sambrook et al., Molecular Cloning, A Laboratory Manual, 
2d. ed., Cold Spring Harbor Laboratory Press, Cold Spring 
Harbor, New York (1989); Glover, D. M. (ed.), DNA Clon- 
ing: A Practical Approach, MRL Press, Ltd., Oxford, U.K. 
Vol. I, II (1985)). Clones derived from genomic DNA may 
contain regulatory and intron DNA regions in addition to 
coding regions; clones derived from cDNA will contain only 
exon sequences. Whatever the source, the gene should be 
cloned into a suitable vector for propagation of the gene. 

[0094] In the cloning of the gene from genomic DNA, 
DNA fragments are generated, some of which will encode 
the desired gene. The DNA may be cleaved at specific sites 
using various restriction enzymes. Alternatively, one may 
use DNase in the presence of manganese to fragment the 
DNA, or the DNA can be physically sheared, as for example, 
by sonication. The linear DNA fragments can then be 
separated according to size by standard techniques, includ- 
ing but not limited to, agarose and polyacrylamide gel 
electrophoresis and column chromatography. 

[0095] Once the DNA fragments are generated, identifi- 
cation of the specific DNA fragment containing the desired 
gene may be accomplished in a number of ways. For 
example, if an TCAP gene (of any species) or its specific 
RNA, or a derivative thereof (see Section 5.6) is available 
and can be purified and labeled, the generated DNA frag- 
ments may be screened by nucleic acid hybridization to the 
labeled probe (Benton and Davis, Science 196:180 (1977); 
Grunstein And Hogness, Proc. Natl. Acad. Sci. U.S.A. 
72:3961 (1975). Those DNA fragments with substantial 
homology to the probe will hybridize. It is also possible to 
identify the appropriate fragment by restriction enzyme 
digestion(s) and comparison of fragment sizes with those 
expected according to a known restriction map if such is 
available. Further selection can be carried out on the basis of 
the properties of the gene. 

[0096] Alternatively, the presence of the gene may be 
detected by assays based on the physical, chemical, or 
immunological properties of its expressed product. For 
example, cDNA clones, or DNA clones that hybrid-select 
the proper mRNAs, can be selected that produce a protein 
having e.g., similar or identical electrophoretic migration, 
isoelectric focusing behavior, proteolytic digestion maps, 
kinase activity, inhibition of cell proliferation activity, sub- 
strate binding activity, or antigenic properties as known for 
a specific TCAP. If an antibody to a particular TCAP is 
available, that TCAP may be identified by binding of labeled 
antibody to the clone(s) putatively producing the TCAP in 
an ELISA (enzyme -linked immunosorbent assay)-type pro- 
cedure. 

[0097] A TCAP gene can also be identified by mRNA 
selection by nucleic acid hybridization followed by in vitro 
translation. In this procedure, fragments are used to isolate 
complementary mRNAs by hybridization. Such DNA frag- 
ments may represent available, purified DNA of another 
species containing a gene encoding a TCAP. Immunopre- 



cipitation analysis or functional assays (e.g., aggregation 
ability in vitro; binding to receptor; see infra) of the in vitro 
translation products of the isolated products of the isolated 
mRNAs identifies the mRNA and, therefore, the comple- 
mentary DNA fragments that contain the desired sequences. 
In addition, specific mRNAs may be selected by adsorption 
of polysomes isolated from cells to immobilized antibodies 
specifically directed against a specific TCAP. A radiola- 
beled TCAP cDNA can be synthesized using the selected 
mRNA (from the adsorbed polysomes) as a template. The 
radiolabeled mRNA or cDNA may then be used as a probe 
to identify the TCAP DNA fragments from among other 
genomic DNA fragments. 

[0098] Alternatives to isolating the TCAP genomic DNA 
include, but are not limited to, chemically synthesizing the 
gene sequence itself from a known sequence or making 
cDNA to the mRNA which encodes a TCAP. For example, 
RNA for cDNA cloning of TA-GPCR, TA-GAP, 
TA-NFKBH, TA-KRP, TA-PP2C, TA-WDRP or TA-LBRP 
can be isolated from cells that express TA-GPCR, TA-GAP, 
TA-NFKBH, TA-KRP, TA-PP2C, TA-WDRP or TA-LRRP. 
Other methods are possible and within the scope of the 
invention. 

[0099] The identified and isolated gene can then be 
inserted into an appropriate cloning vector. A large number 
of vector-host systems known in the art may be used. 
Possible vectors include, but are not limited to, plasmids or 
modified viruses, but the vector system must be compatible 
with the host cell used. Such vectors include, but are not 
limited to, bacteriophages such as lambda derivatives, or 
plasmids such as pBR322 or pUC plasmid derivatives or the 
pBluescript vector (Stratagene). The insertion into a cloning 
vector can, for example, be accomplished by ligating the 
DNA fragment into a cloning vector which has complemen- 
tary cohesive termini. However, if the complementary 
restriction sites used to fragment the DNA are not present in 
the cloning vector, the ends of the DNA molecules may be 
enzymatically modified. Alternatively, any site desired may 
be produced by ligating nucleotide sequences (linkers) onto 
the DNA termini; these ligated linkers may comprise spe- 
cific chemically synthesized oligonucleotides encoding 
restriction endonuclease recognition sequences. In an alter- 
native method, the cleaved vector and TCAP-encoding gene 
may be modified by homopolymeric tailing. Recombinant 
molecules can be introduced into host cells via transforma- 
tion, transfection, infection, electroporation, etc., so that 
many copies of the gene sequence are generated. 

[0100] In an alternative method, the desired gene may be 
identified and isolated after insertion into a suitable cloning 
vector in a "shotgun" approach. Enrichment for the desired 
gene, for example, by size fractionization, can be done 
before insertion into the cloning vector. 

[0101] In specific embodiments, transformation of host 
cells with recombinant DNA molecules that incorporate the 
isolated TCAP-encoding gene, cDNA, or synthesized DNA 
sequence enables generation of multiple copies of the gene. 
Thus, the gene may be obtained in large quantities by 
growing transformants, isolating the recombinant DNA mol- 
ecules from the transformants and, when necessary, retriev- 
ing the inserted gene from the isolated recombinant DNA. 

[0102] It will be understood that the RNA sequence 
equivalent of the nucleotide sequences provided herein can 
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be easily and routinely generated by the substitution of 
thymine (T) residues with uracil (U) residues. 

[0103] The TCAP sequences provided by the instant 
invention include those nucleotide sequences encoding sub- 
stantially the same amino acid sequences as found in native 
TCAP proteins, and those encoded amino acid sequences 
with functionally equivalent amino acids, as well as those 
encoding other TCAP derivatives, as described in Sections 
5.6 and 5.6.1 infra for derivatives of the TCAPs described 
herein. 

5.2. EXPRESSION OF GENES ENCODING 
TCAPS 

[0104] The nucleotide sequence coding for a TCAP or a 
functionally active fragment or other derivative thereof (see 
Section 5.6), can be inserted into an appropriate expression 
vector, i.e., a vector which contains the necessary elements 
for the transcription and translation of the inserted protein- 
coding sequence. The necessary transcriptional and transla- 
tional signals can also be supplied by the native TCAP gene 
and/or its flanking regions. A variety of host -vector systems 
may be utilized to express the protein-coding sequence. 
These include but are not limited to mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); 
insect cell systems infected with virus (e.g., baculo virus); 
microorganisms such as yeast containing yeast vectors, or 
bacteria transformed with bacteriophage, DNA, plasmid 
DNA, or cosmid DNA. The expression elements of vectors 
vary in their strengths and specificities. Depending on the 
host-vector system utilized, any one of a number of suitable 
transcription and translation elements may be used. In 
specific embodiments, the human TA-GPCR, TA-GAP, TA- 
NFKBH, TA-KRP, TA-PP2C, TA-WDRP or TA-LRRP gene 
is expressed, or a sequence encoding a functionally active 
portion of human TCAP encoded by one of these genes is 
expressed. In yet another embodiment, a fragment of a 
TCAP comprising a domain of the particular TCAP is 
expressed. 

[0105] Any of the methods previously described for the 
insertion of DNA fragments into a vector may be used to 
construct expression vectors containing a chimeric gene 
consisting of appropriate transcriptional/translational con- 
trol signals and the protein coding sequences. These meth- 
ods may include in vitro recombinant DNA and synthetic 
techniques and in viva recombinants (genetic recombina- 
tion). Expression of nucleic acid sequence encoding a TCAP 
or peptide fragment thereof may be regulated by a second 
nucleic acid sequence so that the TCAP or peptide fragment 
thereof is expressed in a host transformed with the recom- 
binant DNA molecule. For example, expression of a TCAP 
may be controlled by any promoter/enhancer element known 
in the art. In a specific embodiment, the promoter is heter- 
ologous to (i.e., not a native promoter of) the specific 
TCAP-encoding gene. Promoters that may be used to con- 
trol expression of TCAP-encoding genes include, but are not 
limited to, the SV40 early promoter region (Bemoist and 
Chambon, Nature 290:304-310 (1981)), the promoter con- 
tained in the 3' long terminal repeat of Rous sarcoma virus 
(Yamamoto et al., Cell 22:787-797 (1980)), the herpes 
thymidine kinase promoter (Wagner et al., Proc. Natl. Acad. 
Sci. U.S.A. 78:1441-1445 (1981)), the regulatory sequences 
of the metallothionein gene (Brinster et al., Nature 296:39- 
42 (1982)); prokaryotic expression vectors such as the 



p-lactamase promoter (Villa-Kamaroff et al., Proc. Natl. 
Acad. Sci. U.S.A. 75:3727-3731 (1978)), or the tat promoter 
(DeBoer et al., Proc. Natl. Acad. Sci. U.S.A. 80:21-25 
(1983)); see also "Useful proteins from recombinant bacte- 
ria" in Scientific American, 242:74-94 (1980); plant expres- 
sion vectors comprising the nopaline synthetase promoter 
region (Herrera-Estrella et al., Nature 303:209-213 (1983)) 
or the cauliflower mosaic virus 35S RNA promoter (Gardner 
et al., Nucl. Acids Res. 9:2871 (1981)), and the promoter of 
the photosynthetic enzyme ribulose biphosphate carboxy- 
lase (Herrera-Estrella et al., Nature 310:115-120 (1984)); 
promoter elements from yeast or other fungi such as the 
Gal4 promoter, the ADC (alcohol dehydrogenase) promoter, 
PGK (phosphoglycerol kinase) promoter, alkaline phos- 
phatase promoter, and the following animal transcriptional 
control regions, which exhibit tissue specificity and have 
been utilized in transgenic animals: elastase I gene control 
region active in pancreatic acinar cells (Swift et al., Cell 
38:639-646 (1984); Ornitz et al., Cold Spring Harbor Symp. 
Quant. Biol. 50:399-409 (1986); MacDonald, Hepatology 
7:425-515 (1987)); insulin gene control region active in 
pancreatic beta cells (Hanahan, Nature 315: 115-122 (1985)), 
immunoglobulin gene control region active in lymphoid 
cells (Grosschedl et al., Cell 38:647-658 (1984); Adames et 
al., Nature 318:533-538 (1985); Alexander et al., Mol. Cell. 
Biol. 7:1436-1444 (1987)), mouse mammary tumor virus 
control region active in testicular, breast, lymphoid and mast 
cells (Leder et al., Cell 45:485-495 (1986)), albumin gene 
control region active in liver (Pinkert et al., Genes and 
Devel. 1:268-276 (1987)), alp ha -fetoprotein gene control 
region active in liver (Krumlauf et al., Mol. Cell. Biol. 
5:1639-1648 (1985); Hammer et al., Science 235:53-58 
(1987); alpha 1 -antitrypsin gene control region active in the 
liver (Kelsey et al., Genes and Devel. 1:161-171 (1987)), 
beta-globin gene control region active in myeloid cells 
(Mogram et al., Nature 315:338-340 (1985); Kollias et al., 
Cell 46:89-94 (1986); myelin basic protein gene control 
region active in oligodendrocyte cells in the brain (Readhead 
et al., Cell 48:703-712 (1987)); myosin light chain-2 gene 
control region active in skeletal muscle (Sani, Nature 
314:283-286 (1985)), and gonadotropic releasing hormone 
gene control region active in the hypothalamus (Mason et 
al., Science 234:1372-1378 (1986)). 

[0106] In a specific embodiment, a vector is used that 
comprises a promoter operably linked to a TCAP-encoding 
nucleic acid, one or more origins of replication, and, option- 
ally, one or more selectable markers (e.g., an antibiotic 
resistance gene). 

[0107] In a specific embodiment, an expression construct 
is made by subcloning the coding sequence from a TCAP 
gene into the EcoRI restriction site of each of the three 
pGEX vectors (Glutathione S-Transferase expression vec- 
tors; Smith and Johnson, Gene 7:31-40 (1988)). This allows 
for the expression of the TCAP product from the subclone in 
the correct reading frame. 

[0108] Expression vectors containing TCAP-encoding 
gene inserts can be identified by three general approaches: 
(a) nucleic acid hybridization, (b) presence or absence of 
"marker" gene functions, and (c) expression of inserted 
sequences. In the first approach, the presence of a TCAP- 
encoding gene inserted in an expression vector can be 
detected by nucleic acid hybridization using probes com- 
prising sequences that are homologous to an inserted TCAP- 
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encoding gene. In the second approach, the recombinant 
vector/host system can be identified and selected based upon 
the presence or absence of certain "marker" gene functions 
(e.g., thymidine kinase activity, resistance to antibiotics, 
transformation phenotype, occlusion body formation in 
baculovirus, etc.) caused by the insertion of a TCAP gene in 
the vector. For example, if the TCAP-encoding gene is 
inserted within the marker gene sequence of the vector, 
recombinants containing the insert can be identified by the 
absence of the marker gene function. In the third approach, 
recombinant expression vectors can be identified by assay- 
ing the specific TCAP product expressed by the recombi- 
nant. Such assays can be based, for example, on the physical 
or functional properties of the specific TCAP in in vitro 
assay systems, e.g., kinase activity, binding with antibodies 
directed to the specific TCAP, or inhibition of cell func- 
tions) performed, facilitated or affected by the specific 
TCAP. 

[0109] Once a particular recombinant DNA molecule is 
identified and isolated, several methods known in the art 
may be used to propagate it. Once a suitable host system and 
growth conditions are established, recombinant expression 
vectors can be propagated and prepared in quantity. As 
previously explained, the expression vectors that can be 
used include, but are not limited to, the following vectors or 
their derivatives: human or animal viruses such as vaccinia 
virus or adenovirus; insect viruses such as baculovirus; yeast 
vectors; bacteriophage vectors (e.g., lambda), and plasmid 
and cosmid DNA vectors. 

[0110] In addition, a host cell strain may be chosen which 
modulates the expression of the inserted sequences, or 
modifies and processes the gene product in the specific 
fashion desired. Expression from certain promoters can be 
elevated in the presence of certain inducers; thus, expression 
of the genetically engineered TCAP may be controlled. 
Furthermore, different host cells have characteristic and 
specific mechanisms for the translational and post-transla- 
tional processing and modification (e.g., glycosylation, 
phosphorylation of proteins. Appropriate cell lines or host 
systems can be chosen to ensure the desired modification 
and processing of the foreign protein expressed. 

[0111] For example, expression in a bacterial system can 
be used to produce an unglycosylated core protein product. 
Expression in yeast will produce a glycosylated product. 
Expression in mammalian cells can be used to ensure 
"native" glycosylation of a heterologous protein. Further- 
more, different vector/host expression systems may affect 
processing reactions to different degrees. 

[0112] In other specific embodiments, the specific TCAP, 
or fragment or derivative thereof, may be expressed as a 
fusion, or chimeric protein product, comprising the protein, 
fragment or derivative joined via a peptide bond to a protein 
sequence derived from a different protein. Such a chimeric 
product can be made by ligating the appropriate nucleic acid 
sequences encoding the desired amino acid sequences to 
each other by methods known in the art, in the proper coding 
frame, and expressing the chimeric product by methods 
commonly known in the art. In one embodiment, therefore, 
the invention includes an isolated nucleic acid comprising a 
sequence of at least 10 nucleotides encoding a chimeric 
TCAP, wherein the chimeric TCAP displays at least one of 
the functional activities of the wild- type TCAP, and at least 



one non-TCAP functional activity. Alternatively, such a 
chimeric product may be made by protein synthetic tech- 
niques, e.g., by use of a peptide synthesizer. 

[0113] A person of skill in the art will appreciate that 
cDNA, genomic, and synthesized sequences can be cloned 
and expressed. One way to accomplish such expression is by 
transferring a TCAP gene, or a nucleic acid encoding a 
TCAP or fragment thereof, to cells in tissue culture. The 
expression of the transferred gene may be controlled by its 
native promoter, or can be controlled by a non-native 
promoter (see supra; Section 5.7.3.1, infra). In addition to 
transferring a nucleic acid comprising a nucleic acid 
sequence encoding an entire TCAP (i.e., equivalent to the 
wild type), the transferred nucleic acids can encode a 
functional portion of a particular TCAP, or a protein having 
at least 60% sequence identity to a TCAP disclosed herein, 
as compared over the length of the particular TCAP, or a 
polypeptide having at least 60% sequence similarity to a 
TCAP fragment, as compared over the length of the TCAP 
fragment. Introduction of the nucleic acid into the cell is 
accomplished by such methods as electroporation, lipofec- 
tion, calcium phosphate mediated transfection, or viral 
infection. Usually, the method of transfer includes the trans- 
fer of a selectable marker to the cells. The cells are then 
placed under selection to isolate those cells that have taken 
up and are expressing the transferred gene. The expressed 
TCAPs or fragments thereof are isolated and purified as 
described below. 

5.3. IDENTIFICATION AND PURIFICATION OF 
TCAP GENE PRODUCTS 

[0114] In particular aspects, the invention provides amino 
acid sequences of TCAPs, preferably human TCAPs, and 
fragments and derivatives thereof which comprise an anti- 
genic determinant (i.e., can be recognized by an antibody) or 
which are otherwise functionally active, as well as nucleic 
acid sequences encoding the foregoing. "Functionally 
active" TCAP material as used herein refers to that material 
displaying one or more known functional activities associ- 
ated with a full-length (wild- type) TCAP, e.g., activities 
associated with G-coupled proteins (TA-GPCR), GTPase- 
inducing activity (TA-GAP), transcriptional activation activ- 
ity (TA-NFKBH), protease activity (TA-PP2C) or transdu- 
cin-like activity (i.e., the ability to transmit a signal between 
a GPCR and an effector protein (TA-WDRP); inhibition of 
these activities; binding to a substrate or binding partner of 
the proteins listed above; or antigenicity (binding to an 
antibody raised against one of these proteins), immunoge- 
nicity, and so forth. 

[0115] In specific embodiments, the invention provides 
fragments of TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, 
TA-PP2C, TA-WDRP or TA-LRRP consisting of at least 6 
amino acids, 10 amino acids, 50 amino acids, or of at least 
75 amino acids. In other embodiments, the proteins com- 
prise or consist essentially of an extracellular ligand-binding 
domain, transmembrane domain, or intracellular domain 
(TA-GPCR); Kelch repeats (TA-KRP); WD repeat domain 
(TA-WDRP); (3-propeller (TA-KRP or TA-WDRP); GTP- 
binding domain (TA-GPCR; TA-GAP); rhoGAP domain 
(TA-GAP); Ankyrin repeat-containing domain (TA- 
NFKBH), leucine repeat-rich domain (TA-LLRP), POZ/ 
BTB domain (TA-KRP), PP2C box (TA-PP2C), or any 
combination of the foregoing, of the above TCAPs. Frag- 
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ments, or proteins comprising fragments, lacking some or all 
of the foregoing regions of the above TCAPs are also 
provided. Nucleic acids encoding the foregoing are also 
provided. 

[0116] Once a recombinant that expresses the TCAP- 
encoding gene sequence, or part thereof, is identified, the 
resulting product can be analyzed. This analysis is achieved 
by assays based on the physical or functional properties of 
the product, including radioactive labeling of the product 
followed by analysis by gel electrophoresis, immunoassay, 
etc. 

[0117] Once the particular TCAP, or fragment thereof, is 
identified, it may be isolated and purified by standard 
methods including chromatography (e.g., ion exchange, 
affinity, and sizing column chromatography), centrifugation, 
differential solubility, or by any other standard technique for 
the purification of proteins. The functional properties may be 
evaluated using any suitable assay (see Section 5.7). 

[0118] Alternatively, once a TCAP produced by a recom- 
binant is identified, the amino acid sequence of the protein 
can be deduced from the nucleotide sequence of the chi- 
meric gene contained in the recombinant. As a result, the 
protein can be synthesized by standard chemical methods 
known in the art (e.g., see Hunkapiller et al., Nature 
310:105-111 (1984)). 

[0119] In another alternate embodiment, native 
TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, TA-PP2C, TA- 
WDRP or TA-LRRP proteins can be purified from natural 
sources, by standard methods such as those described above 
(e.g., immuno affinity purification). 

[0120] In a specific embodiment of the present invention, 
such TCAPs, whether produced by recombinant DNA tech- 
niques or by chemical synthetic methods or by purification 
of native proteins, include but are not limited to those 
containing, as a primary amino acid sequence, all or part of 
the amino acid sequence substantially as depicted in FIGS. 
1A-1E (SEQ ID NOS: 3), FIGS. 2A-2D (SEQ ID NO: 4), 
FIGS. 3A-3D and 4A-4C (SEQ ID NO: 7), FIGS. 5A-5G 
(SEQ ID NO: 9), FIGS. 6A-6C (SEQ ID NOS: 11), FIGS. 
7A-7D (SEQ ID NO: 13), FIGS. 8A-8F (SEQ ID NO: 15), 
FIGS. 9A-9H (SEQ ID NO: 17) and FIGS. 10A-10E (SEQ 
ID NO: 19), as well as fragments and other derivatives 
thereof, including proteins homologous thereto. 

5.4. STRUCTURE OF TCAP-ENCODING GENES 
AND ENCODED PROTEINS 

[0121] The structure of the genes encoding TCAPs, and 
the encoded TCAPs, can be analyzed by various methods 
known in the art, as described in the following sections. 

5.4.1. GENETIC ANALYSIS 

[0122] The cloned DNA or cDNA corresponding to a 
TCAP-encoding gene can be analyzed by methods includ- 
ing, but not limited to. Southern hybridization (Southern, E. 
M., J. Mol. Biol. 98:503-517 (1975)), northern hybridization 
(see e.g., Freeman et al., Proc. Natl. Acad. Sci. U.S.A. 
80:4094-4098 (1983)), restriction endonuclease mapping 
(Maniatis, T, Molecular Cloning, A Laboratory, Cold Spring 
Harbor, N.Y. (1982)), and DNA sequence analysis. Poly- 
merase chain reaction (PCR; U.S. Pat. Nos. 4,683,202, 
4,683,195 and 4,889,818; Gyllenstein et al., Proc. Natl. 



Acad. Sci. U.S.A. 85:7652-7656 (1988); Ochman et al., 
Genetics 120:621-623 (1988); Loh et al., Science 243:217- 
220 (1989)) followed by Southern hybridization with a 
probe specific to one of the TCAP-encoding genes can allow 
the detection of that particular TCAP-encoding gene in DNA 
from various cell types from various vertebrate sources. 
Methods of amplification other than PCR are commonly 
known and can also be employed. In one embodiment. 
Southern hybridization can be used to determine the genetic 
linkage of a particular TCAP gene. Northern hybridization 
analysis can be used to determine the expression of a 
particular TCAP gene. Various cell types, at various states of 
development or activity can be tested for expression of a 
particular TCAP gene. In one preferred embodiment, screen- 
ing arrays comprising probes homologous to the exons of 
particular TCAP-encoding genes are used to determine the 
state of expression of these genes, or specific exons of these 
genes, in various cell types, under particular environmental 
or perturbance conditions, or in various vertebrates. The 
stringency of the hybridization conditions for both Southern 
and northern hybridization can be manipulated to ensure 
detection of nucleic acids with the desired degree of relat- 
edness to the specific probe used. Modifications of these 
methods and other methods commonly known in the art can 
be used. 

[0123] Restriction endonuclease mapping can be used to 
roughly determine the genetic structure of a TCAP gene. 
Restriction maps derived by restriction endonuclease cleav- 
age can be confirmed by DNA sequence analysis. The 
genetic structure of a TCAP gene can also be determined 
using scanning oligonucleotide arrays, wherein the expres- 
sion of one exon is correlated with the expression of a 
plurality of neighboring exons, such that the correlation 
indicates the correlated exons are contained within the same 
gene. The structure so determined can be confirmed by PCR. 

[0124] DNA sequence analysis can be performed by any 
techniques known in the art, including but not limited to the 
method of Maxam and Gilbert, Meth. Enzymol. 65:499- 
5601 (1980), the Sanger dideoxy method (Sanger, F., et al., 
Proc. Natl. Acad. Sci. U.S.A. 74:5463 (1977)), the use of T7 
DNA polymerase (Tabor and Richardson, U.S. Pat. No. 
4,795,699), or use of an automated DNA Sequenator (e.g., 
Applied Biosystems, Foster City, Calif.). The sequencing 
method may use radioactive or fluorescent labels. 

5.4.2. PROTEIN ANALYSIS 

[0125] The amino acid sequence of a particular TCAP can 
be derived by deduction from the DNA sequence, or alter- 
natively, by direct sequencing of the protein, e.g., with an 
automated amino acid sequencer. 

[0126] The protein sequence of a TCAP can be character- 
ized by a hydrophilicity analysis (Hopp and Woods, Proc. 
Natl. Acad. Sci. U.S.A. 78:3824 (1981)). A hydrophilicity 
profile is used to identify the hydrophobic and hydrophilic 
regions of a TCAP and the corresponding regions of the gene 
sequence which encode such regions. 

[0127] Secondary structural analysis (Chou and Fasman, 
Biochemistry 13:222 (1974)) can also be done, to identify 
regions of particular TCAPs that assume specific secondary 
structures, such as a-helices, (3 -pleated sheets or turns. 

[0128] Manipulation, translation, secondary structure pre- 
diction, open reading frame prediction and plotting, as well 



US 2003/01 19024 Al 



15 



Jun. 26, 2003 



as determination of sequence homologies, can also be 
accomplished using computer software programs and nucle- 
otide and protein sequence databases available in the art. 
Protein and/or nucleotide sequence homologies to known 
proteins or DNA sequences can be used to deduce the likely 
function of a particular TCAP, or domains thereof. 

[0129] Other methods of structural analysis can also be 
employed. These include but are not limited to X-ray 
crystallography (Engstom, Biochem. Exp. Biol. 11:7-13 
(1974)) and computer modeling (Fletterick, and Zoller, 
(eds.), Computer Graphics and Molecular Modeling, in 
Current Communications in Molecular Biology, Cold Spring 
Harbor Laboratory, Cold Spring Harbor, N.Y (1986)). 

[0130] In addition to determinations of the TCAP protein 
structure, the invention provides method of identifying a 
molecule that specifically binds to a ligand selected from the 
group consisting of a TCAP, a fragment of a TCAP com- 
prising a domain of the TCAP, and a nucleic acid encoding 
the TCAP or fragment thereof, comprising (a) contacting 
said ligand with a plurality of molecules under conditions 
conducive to binding between said ligand and the molecules; 
and (b) identifying a molecule within said plurality that 
specifically binds to said ligand. 

5.5. GENERATION OF ANTIBODIES TO TCAPS 
AND DERIVATIVES THEREOF 
[0131] According to the invention, a TCAP, its fragments, 
or other derivatives thereof may be used as an immunogen 
to generate antibodies which immunospecifically bind such 
an immunogen. Such antibodies include but are not limited 
to polyclonal, monoclonal, chimeric and single chain anti- 
bodies, as well as Fab fragments and an Fab expression 
library. In a specific embodiment, antibodies to human 
TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, TA-PP2C, TA- 
WDRP or TA-LRRP are produced. In another embodiment, 
antibodies to a domain of a particular TCAP are produced. 
In a specific embodiment, fragments of a TCAP protein 
identified as hydrophilic are used as immunogens for anti- 
body production. 

[0132] Various procedures known in the art may be used 
for the production of polyclonal antibodies to a specific 
TCAP, or derivative thereof In a particular embodiment, 
rabbit polyclonal antibodies to an epitope of a TCAP 
encoded by a sequence of SEQ ID NO: 1, SEQ ID NO: 2, 
SEQ ID NO: 5, SEQ ID NO:6, SEQ ID NO: 8, SEQ ID NO: 
10, SEQ ID NO: 12, SEQ ID NO: 14, SEQ ID NO: 16 or 
SEQ ID NO: 18 or a subsequence thereof, can be obtained. 
For the production of antibody, various host animals can be 
immunized by injection with native TCAP, or a synthetic 
version or derivative (e.g., fragment) thereof, including, but 
not limited to, rabbits, mice, rats, goats, bovines or horses. 
Various adjuvants may be used to increase the immunologi- 
cal response, depending on the host species. Adjuvants that 
may be used according to the present invention include, but 
are not limited to, Freund's (complete and incomplete), 
mineral gels such as aluminum hydroxide, surface active 
substances such as lysolecithin, pluronic polyols, polyan- 
ions, peptides, oil emulsions, keyhole limpet hemocyanins, 
dinitrophenol, and potentially useful human adjuvants such 
as BCG (bacille Calmette-Guerin) and corynebacterium 
parvum. 

[0133] For preparation of monoclonal antibodies directed 
toward a TCAP sequence or derivative thereof, any tech- 



nique which provides for the production of antibody mol- 
ecules by continuous cell lines in culture may be used. For 
example, monoclonal antibodies may be prepared by the 
hybridoma technique originally developed by Kohler and 
Milstein, Nature 256:495-497 (1975), as well as the trioma 
technique, the human B-cell hybridoma technique (Kozbor 
et al, Immunol. Today 4:72 (1983)), or the EBV-hybridoma 
technique (Cole et al, in MONOCLONAL ANTIBODIES 
AND CANCER THERAPY, Alan R. Liss, Inc., pp. 77-96 
(1985)). In an additional embodiment of the invention, 
monoclonal antibodies can be produced in germ-free ani- 
mals utilizing recent technology (PCT/US 90/02545). 
According to the invention, human antibodies may be used 
and can be obtained by using human hybridomas (Cote et al, 
Proc. Natl. Acad. Sci. U.S.A., 80:2026-2030 (1983)) or by 
transforming human B cells with EBV virus in vitro (Cole 
et al, in MONOCLONAL ANTIBODIES AND CANCER 
THERAPY, Alan R. Liss, pp. 77-96 (1985)). Furthermore, 
according to the invention, techniques developed for the 
production of "chimeric antibodies" (Morrison et al, Proc. 
Natl. Acad. Sci. U.S.A. 81:6851-6855 (1984); Neuberger et 
al, Nature 312:604-608 (1984); Takeda et al., Nature 
314:452-454 (1985)) can be used, wherein genes from a 
mouse antibody molecule specific to a particular TCAP are 
spliced to genes encoding a human antibody molecule of 
appropriate biological activity can be used; such antibodies 
are within the scope of this invention. 

[0134] According to the invention, techniques described 
for the production of single chain antibodies (U.S. Pat. No. 
4,946,778) can be adapted to produce single chain antibod- 
ies specific to a particular TCAP. An additional embodiment 
of the invention utilizes the techniques described for the 
construction of Fab expression libraries (Huse et al, Science 
246:1275-1281 (1988)) to allow rapid and easy identifica- 
tion of monoclonal Fab fragments with the desired speci- 
ficity for particular TCAPs or derivatives thereof. Antibody 
fragments which contain the idiotype of the molecule can be 
generated by known techniques. For example, such frag- 
ments include but are not limited to: the F(ab'), fragment 
which can be produced by pepsin digestion of the antibody 
molecule; the Fab' fragments which can be generated by 
reducing the disulfide bridges of the F(ab'), fragment, the 
Fab fragments which can be generated by treating the 
antibody molecule with papain and a reducing agent, and Fv 
fragments. 

[0135] In the production of antibodies, screening for the 
desired antibody can be accomplished by techniques known 
in the art, e.g. ELISA (enzyme -linked immunosorbent 
assay), RIA (radioimmunoassay) or RIBA (recombinant 
immunoblot assay). For example, to select antibodies which 
recognize a specific domain of a TCAP, one may assay 
generated hybridomas for a product which binds to a TCAP 
fragment containing such domain. For selection of an anti- 
body that specifically binds a first TCAP homolog but which 
does not specifically bind a second, different TCAP 
homolog, one can select on the basis of positive binding to 
the first TCAP homolog and a lack of binding to the second 
TCAP homolog. 

[0136] Antibodies specific to a domain of a TCAP are also 
provided. The foregoing antibodies can be used in methods 
known in the art relating to the localization and activity of 
the TCAP sequences of the invention, e.g., for imaging these 
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proteins, measuring levels thereof in appropriate physiologi- 
cal samples, in diagnostic methods, etc. 

[0137] In another embodiment of the invention, antibodies 
to particular TCAPs, and antibody fragments thereof con- 
taining the binding domain are therapeutics (see infra). In a 
preferred embodiment, the antibodies are isolated or puri- 
fied. 

5.6. TCAPS AND TCAP DERIVATIVES 

[0138] The invention further relates to specific TCAPs and 
derivatives (including but not limited to fragments) of these 
specific TCAPs (e.g.,TA-GPCR, TA-GAP, TA-NFKBH, TA- 
KRP, TA-PP2C, TA-WDRP or TA-LRRP). Nucleic acids 
encoding derivatives and protein of these TCAPs are also 
provided. In one embodiment, specific TCAPs are encoded 
by the associated TCAP nucleic acids described in Section 
5.1 supra. 

[0139] The production and use of derivatives produced 
through modification of TCAP-encoding genes are within 
the scope of the present invention. In a specific embodiment, 
the derivative is functionally active, i.e., capable of exhib- 
iting one or more functional activities associated with a 
full-length, wild-type TCAP. As one example, such deriva- 
tives that have the desired immunogenicity or antigenicity 
can be used, for example, in immunoassays, for immuniza- 
tion, for inhibition of the activity of a specific TCAP, etc. As 
another example, such derivatives that substantially have the 
desired TCAP activity, or which are phosphorylated or 
dephosphorylated, are provided. Derivatives that retain, or 
alternatively lack or inhibit, a desired TCAP property of 
interest, a specific activity, such as activity associated with 
G-coupled proteins (TA-GPCR), GTPase -inducing activity 
(TA-GAP), transcriptional activation activity (TA-NFKBH), 
protease activity (TA-PP2C) or G-protein activity (TA- 
WDRP); inhibition of these activities), can be used as 
inducers, or inhibitors, respectively, of such a property and 
its physiological correlates. A specific embodiment relates to 
a TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, TA-PP2C, 
TA-WDRP of TA-LRRP fragment that can be bound by an 
antibody directed to the corresponding native TCAP. 
Derivatives of particular TCAPs can be tested for the desired 
activity by procedures known in the art, including but not 
limited to the assays described in Section 5.7. 

[0140] In particular, derivatives of TCAPs can be made by 
altering the nucleotide sequences encoding them by substi- 
tutions, additions or deletions that provide for functionally 
equivalent protein molecules. In a specific embodiment, the 
alteration is made in a nucleic acid sequence encoding all or 
part of TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, 
TA-PP2C, TA-WDRP or TA-LRRP. Due to the degeneracy 
of nucleotide coding sequences, other DNA sequences that 
encode substantially the same amino acid sequence as a 
TCAP-encoding gene may be used in the practice of the 
present invention. These include but are not limited to 
nucleotide sequences comprising all or portions of TCAP- 
encoding genes that are altered by the substitution of dif- 
ferent codons that encode the same amino acid residue 
within the sequence, thus producing a silent change. 

[0141] Likewise, the TCAP derivatives of the invention 
include, but are not limited to, those containing, as a primary 
amino acid sequence, all or part of the amino acid sequence 
of a TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, TA-PP2C, 



TA-WDRP or TA-LRRP protein, including altered 
sequences in which functionally equivalent amino acid 
residues are substituted for residues within the sequence 
resulting in a silent or insubstantial change. For example, 
one or more amino acid residues within the sequence can be 
substituted by another amino acid of a similar polarity which 
acts as a functional equivalent, resulting in a silent alter- 
ation. Substitutes for an amino acid within the sequence may 
be selected from other members of the class to which the 
amino acid belongs. For example, the nonpolar (hydropho- 
bic) amino acids include alanine, leucine, isoleucine, valine, 
proline, phenylalanine, tryptophan and methionine. The 
polar neutral amino acids include glycine, serine, threonine, 
cysteine, tyrosine, asparagine, and glutamine. The positively 
charged (basic) amino acids include arginine, lysine and 
histidine. The negatively charged (acidic) amino acids 
include asp ar tic acid and glutamic acid. 

[0142] In a specific embodiment of the invention, proteins 
consisting of or comprising a fragment of a particular TCAP 
consisting of at least 10 (continuous) amino acids of that 
TCAP protein is provided. In other embodiments, the frag- 
ment consists of at least 20 or 50 amino acids of a particular 
TCAP. In specific embodiments, such fragments are not 
larger than 35, 100 or 200 amino acids. Derivatives of 
TCAPs include but are not limited to those molecules 
comprising regions that are homologous to TA-GPCR, TA- 
GAP, TA-NFKBH, TA-KRP, TA-PP2C, TA-WDRP or TA- 
LRRP or fragments thereof. "Homologous" means that in 
various embodiments, two amino acid sequences share 
preferably at least 60% or 70%, more preferably at least 80% 
or 90%, and even more preferably at least 95% sequence 
identity over an amino acid sequence of identical size. When 
the alignment is done by a computer homology program 
known in the art, such as BLAST (blastp), the percent 
homology is calculated by dividing the number of amino 
acids in the TCAP sequence or fragment thereof into the 
number of amino acids of the TCAP sequence exactly 
matching the amino acid at the same position in the second 
sequence, where introduced gaps count as a mismatch, and 
where conservative changes count as a match. A BLAST 
comparison can also determine the "sequence similarity" 
between two proteins, where sequence similarity is defined 
as a positive score in a BLOSLUM62 scoring matrix com- 
parison of the two sequences. 

[0143] Derivatives of TCAPs also include molecules 
whose encoding nucleic acid is capable of hybridizing to a 
TCAP-encoding sequence, under stringent, moderately 
stringent, or nonstringent conditions. 

[0144] The TCAP derivatives of the invention can be 
produced by various methods known in the art. The manipu- 
lations which result in their production can occur at the gene 
or protein level. For example, the cloned gene sequence of 
a TCAP gene can be modified by any of numerous strategies 
known in the art (Maniatis, Molecular Cloning, A Labora- 
tory Manual, 2d. ed., Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y. (1990)). The sequence can be cleaved 
at appropriate sites with restriction endonuclease(s), fol- 
lowed by further enzymatic modification if desired, then 
isolated and ligated in vitro. In the production of a gene 
encoding a derivative of a TCAP, care should be taken to 
ensure that the modified gene remains within the same 
translational reading frame as the TCAP gene, uninterrupted 



US 2003/01 19024 Al 



17 



Jun. 26, 2003 



by translational stop signals, in the gene region where the 
desired TCAP activity is encoded. 

[0145] Additionally, a TCAP-encoding nucleic acid 
sequence can be mutated in vitro or in vivo to create and/or 
destroy translation, initiation, and/or termination sequences, 
or to create variations in coding regions and/or form new 
restriction endonuclease sites or destroy preexisting ones, to 
facilitate further in vitro modification. Any technique for 
mutagenesis known in the art can be used, including but not 
limited to, chemical mutagenesis, in vitro site-directed 
mutagenesis (Hutchinson, et al., J. Biol. Chem. 
253:6551(1978)), use of TAB linkers (Pharmacia), PGR 
using mutagenizing primers, and so forth. 

[0146] Manipulations of a TCAP sequence may also be 
made at the protein level. Included within the scope of the 
invention are TCAP fragments or other derivatives which 
are differentially modified during or after translation, e.g., by 
glycosylation, acetylation, phosphorylation, amidation, 
derivatization by known protectingA^locking groups, pro- 
teolytic cleavage, or linkage to an antibody molecule or 
other cellular ligand. Any of numerous chemical modifica- 
tions may be carried out by known techniques, including, 
but not limited to, specific chemical cleavage by cyanogen 
bromide, trypsin, chymotrypsin, papain, V8 protease, 
NaBH 4 ; acetylation, formylation, oxidation, reduction; 
metabolic synthesis in the presence of tunicamycin; and so 
forth. 

[0147] In addition, derivatives of a TCAP can be chemi- 
cally synthesized. For example, a peptide corresponding to 
a portion of a TCAP that comprises a desired domain (see 
Section 5.6.1), or which mediates the desired activity in 
vitro, can be synthesized by use of a peptide synthesizer. 
Furthermore, if desired, nonclassical amino acids or chemi- 
cal amino acid analogs can be introduced as a substitution or 
addition into the particular TCAP sequence. Non-classical 
amino acids include, but are not limited, to the D -isomers of 
the common amino acids, a-amino isobutyric acid, 4-ami- 
nobutyric acid, Abu, 2-amino butyric acid, y-Abu, e-Ahx, 
6 -amino hexanoic acid, Aib, 2-amino isobutyric acid, 
3 -amino propionic acid, ornithine, norleucine, norvaline, 
hydroxyproline, sarcosine, citrulline, cysteic acid, t-butylg- 
lycine, t-butylalanine, phenylglycine, cyclohexylalanine, 
p -alanine, fluoro-amino acids, designer amino acids such as 
(3 -methyl amino acids, Ca-methyl amino acids, Na-methyl 
amino acids, and amino acid analogs in general. Further- 
more, the amino acid can be D (dextrorotary) or L (levoro- 
tary). 

[0148] In a specific embodiment, the derivative of a par- 
ticular TCAP is a chimeric, or fusion, protein comprising a 
TCAP protein or fragment thereof, preferably consisting of 
at least a domain or motif of the particular TCAP, or at least 
6 amino acids of the particular TCAP, joined at its amino- or 
carboxy-terminus via a peptide bond to an amino acid 
sequence of a different protein. In one embodiment, such a 
chimeric protein is produced by recombinant expression of 
a nucleic acid encoding the protein, comprising a TCAP- 
coding sequence joined in-frame to a coding sequence for a 
different protein. Such a chimeric product can be made by 
ligating the appropriate nucleic acid sequences encoding the 
desired amino acid sequences to each other by methods 
known in the art, in the proper coding frame, and expressing 
the chimeric product by methods commonly known in the 



art. Alternatively, such a chimeric product may be made by 
protein synthetic techniques, e.g. by use of a peptide syn- 
thesizer. Chimeric genes comprising portions of a TCAP 
gene, fused to any heterologous protein-encoding 
sequences, may be constructed. A specific embodiment 
relates to a chimeric protein comprising a fragment of a 
particular TCAP of at least six amino acids. 

[0149] Other specific embodiments of derivatives are 
described in the subsection below and examples sections 
infra. 

5.6.1. DERIVATIVES OF PARTICULAR TCAPS 
CONTAINING ONE OR MORE DOMAINS OF 
THE PROTEIN 

[0150] In a specific embodiment, the invention relates to 
TCAP derivatives, in particular derivatives of TA-GAP, 
TA-GPCR, TA-PP2C, TA-NFKBH, TA-KRP, TA-WDRP, or 
TA-LRRP, including TA-GAP RhoGAP domain, TA-GPCR 
extracellular, transmembrane, intracellular or GTP-binding 
domains, TA-WDRP GPCR-binding or WD motif-contain- 
ing domain, TA-WDRP or TA-KRP p propeller domains 
kelch repeats and POZ/BTB domain; TA-NFKBH ankyrin 
repeats and DNA-binding domains and TA-LRRP trans- 
membrane domains and leucine -rich repeat domains; and 
fragments and derivatives of such fragments, that comprise, 
or alternatively consist of, one or more domains of a TCAP 
including but not limited to a functional (e.g., binding) 
fragment of any of the foregoing, or any combination of the 
foregoing TCAPs. 

[0151] In another specific embodiment, a molecule is 
provided that comprises one or more domains (or functional 
portion thereof) of a particular TCAP protein but that also 
lacks one or more domains (or functional portion thereof) of 
that particular TCAP. In particular examples, TA-GPCR 
derivatives are provided that lack an intracellular, GTP- 
binding, or transmembrane domain. By way of another 
example, such a TA-GPCR may also lack all or a portion of 
the extracellular domain, but retain at least the transmem- 
brane or intracellular domains of a TA-GPCR. In another 
embodiment, a molecule is provided that comprises one or 
more domains (or functional portion thereof) of a TCAP and 
that has one or more mutant (e.g., due to deletion or point 
mutation(s)) domains of a TCAP such that the mutant 
domain has increased or decreased function. By way of 
example, the TA-GPCR extracellular domain may be 
mutated so as to have reduced, absent, or increased ligand- 
binding activity. A person of skill in the art would under- 
stand that fragments comprising one or more domains, or 
one or more mutant domains, may be derived from other 
TCAPs, as well. In a specific embodiment, one, two, or three 
point mutations are present. 

5.7. UTILITY 

[0152] The invention provides TCAPs having useful 
activities. The invention further provides the use of TCAPs 
or derivatives thereof, TCAP nucleic acids, and antibodies 
that recognize TCAPs, or derivatives thereof, as markers for 
the activation, or lack thereof, of T cells. Such markers 
enable the screening for diagnosis, staging and monitoring 
of therapies of diseases and disorders associated with unde- 
sirable T cell activation, or, alternatively, where T cell 
insufficient T cell activation has occurred. For example, the 
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invention provides monitoring of therapies directed to the 
suppression of inappropriate or undesired T cell activation, 
or of therapies directed to the enhancement of T cell 
activation, where such activation is desired. Finally, the 
invention provides for the use of TCAPs or derivatives 
thereof, TCAP nucleic acids, or antibodies that recognize 
TCAPs, or derivatives thereof, as therapeutic agents for the 
treatment of conditions related to T cell activation or lack 
thereof. 

5.7.1. USEFUL ACTIVITIES ASSOCIATED 
WITH TCAPs 

[0153] The TCAPs of the present invention have activities 
useful in their own right. These activities may be used in 
vitro to accomplish desired reactions. They may also be used 
as part of in vitro models of the particular biochemical 
system of which they are a part. Each may also be used as 
a target for immunomodulatory drugs, wherein the immu- 
nomodulatory drug enhances or, more generally, represses, 
the activity of a particular TCAP. Such immonoregulatory 
effect is established either directly by showing an effect on 
T cell activation when applied to model T cells, for example, 
Jurkat T cells, or indirectly by showing a modulation of the 
transcription of one or more TCAP genes, or of the activity 
of one or more TCAPs. The utility of each TCAP described 
herein is discussed in more detail below. 

5.7.1.1. TA-GAP 

[0154] GAPs have the intrinsic activity of stimulating the 
GTPase activity of GTPases. This activity is useful in assays 
of GTPase activity, particularly Rho GTPase activity, on G 
protein-mediated signaling pathways. Assays for Rho GAP 
activity have been described (Toure et al., J. Biol. Chem. 
273(ll):6019-6023 (1998); Ross & Wilkie, Ann. Rev. Bio- 
chem. 69:795-827 (2000)). Furthermore, because of the 
control exerted by GTPases, and therefore, by GAPs, over 
cell growth and proliferation, GAPs are also natural targets 
for drug discovery. Several GAPs have been described as 
useful in the diagnosis and treatment of cancers. See Weiss- 
bach et al., U.S. Pat. No. 5,639,651; Wong et al., U.S. Pat. 
No. 5,760,203. Thus, TA-GAP is highly likely to be useful 
not only for its intrinsic GTPase -regulating activity, but as a 
target for drugs directed to the suppression of T cell acti- 
vation and proliferation. 

5.7.1.2. TA-GPCR 

[0155] The useful activity of a GPCR is its ability to 
transmit extracellular signals to the interior of the cell. As a 
consequence of relatively small ligand-binding sites and the 
wide range of physiological events which they regulate, 
GPCRs have well-known utility as targets for drugs; in fact, 
GPCRs constitute the largest class of drug targets in humans 
(Flower, Biochim. et Biophys. Acta. 1422:207-234 (1999)). 
In fact, existing studies of GPCRs have established a pattern 
for drug discovery that any new drug discovery project 
might reasonably follow. When the sequence for a new 
GPCR is determined, comparison of the sequence to existing 
GPCRs with known functions enables one to determine the 
broad features of the binding site, which, in turn, suggests 
the types of compounds that may be made or selected from 
a compound bank or commercial database to interact with 
that binding site. See Flower, supra. Thus, TA-GPCR is 
useful as a target for drug studies, where the drug in question 



is to modulate T cell activation. A number of GPCRs have 
been described as useful in a variety of diagnostic and/or 
therapeutic applications. See, e.g., MacLennan, U.S. Pat. 
No. 5,585,476; Soppet et al., U.S. Pat. No. 5,756,309; 
Soppet et al., U.S. Pat. No. 5,776,729. Methods for assaying 
for GPCR activity have been described previously (Sadee, 
U.S. Pat. No. 5,882,944; Barak et al., U.S. Pat. No. 5,891, 
646). 

5.7.1.3. TA-WDRP 

[0156] G proteins function to transmit signals received by 
GPCRs to enzymes that create effector molecules, such as 
cAMP, inositol triphosphate, and phospholipase C. The 
useful activity of G proteins thus lies in their place in signal 
transduction, and on this basis, like GPCRs, they have been 
drug targets. See, e.g., Doll et al., U.S. Pat. No. 6,214,828 
(describing compounds directed to G proteins useful in 
reducing cell proliferation). 

5.7.1.4. TA-NFKBH 

[0157] The useful activity of TA-NFKBH is its ability to 
promote the transcription of genes. Thus, TA-NFKBH rep- 
resents another potential target for drug therapies directed to 
the modulation of T cell activation. As noted in Section 2.5, 
the inappropriate regulation of NF-kB and its dependent 
genes has been associated with septic shock, graft-versus- 
host disease, acute inflammatory conditions, acute phase 
response, transplant rejection, autoimmune diseases, and 
cancer (Manna & Agarwal, J. Immunol. 165:2095-2102 
(1999)); as TA-NFKBH is produced during T cell activation, 
it is highly likely that the genes whose transcription is 
promoted by TA-NFKBH are similarly involved in these 
conditions. NF-kB has also been described as an attractive 
and highly useful target for therapies directed to these 
conditions, including small molecule or antisense inhibition. 
See, e.g., Narayanan et al., U.S. Pat. No. 5,591,840. For 
example, one agent, known as A77 1726, exhibits antiin- 
flammatory, antiproliferative and immunosuppressive 
effects by blocking TNF-dpendent NF-kB activation and 
gene expression (Manna & Agarwal, above). Based on the 
sequence homology of TA-NFKBH to NF-kB, it is likely 
that TA-NFKBH is similarly useful as a target for antiin- 
flammatory and immunosuppressive drugs. 

5.7.1.5. TA-PP2C 

[0158] Based on sequence homologies, TA-PP2C is a class 
2C phosphatase (a PP2C) and possesses serine/threonine 
phosphatase activity, that is, the ability to remove phos- 
phates from serine or threonine residues. This activity is 
useful in any assay that involves the kinasing of serine or 
threonine residues, to reverse the kinasing reaction. Assays 
for PP2Cs have been described (Cheng et al., J. Biol. Chem. 
274(44): 34733-34749 (2000); Takekawa et al., EMBO J. 
17:4744-4752 (1998)). Thus, TA-PP2C has utility for its 
intrinsic enzymatic activity. Moreover, TA-PP2C can be 
used to identify inhibitors of serine/threonine phosphatase 
activity in vitro; such assays have been described (Mat- 
suzawa et al., FEBS Lett. 19:356(2-3):272-4 (1994). 

5.7.1.6. ASSAYS OF TCAPS AND TCAP 
DERIVATIVES 

[0159] In addition to the specific assays referenced above, 
the functional activity of TCAPs, derivatives can be assayed 
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by various other methods. For example, in one embodiment, 
where one is assaying for the ability to bind or compete with 
the wild-type of a particular TCAP for binding to an anti- 
body raised against the protein, various immunoassays 
known in the art can be used, including but not limited to 
competitive and non-competitive assay systems using tech- 
niques such as radioimmunoassays, ELISA (enzyme linked 
immunosorbent assay), "sandwich" immunoassays, immu- 
noradiometric assays, gel diffusion precipitin reactions, 
immunodiffusion assays, in situ immunoassays (using col- 
loidal gold, enzyme or radioisotope labels, for example), 
western blots, precipitation reactions, agglutination assays 
(e.g., gel agglutination assays, hemagglutination assays), 
complement fixation assays, immunofluorescence assays, 
protein A assays, and immunoelectrophoresis assays, etc. In 
one embodiment, antibody binding is detected by detecting 
a label on the primary antibody. In another embodiment, the 
primary antibody is detected by detecting binding of a 
secondary antibody or reagent to the primary antibody. In a 
further embodiment, the secondary antibody is labeled. 
Many means are known in the art for detecting binding in an 
immunoassay and are within the scope of the present inven- 
tion. 

[0160] In another embodiment, in those situations where a 
TCAP -binding protein is identified, the binding can be 
assayed, e.g., by means well-known in the art. In another 
embodiment, physiological correlates of the binding of a 
TCAP to its substrate(s) can be assayed. 

[0161] In another embodiment, in insect or other model 
systems, genetic studies can be done to study the phenotypic 
effect of a TCAP mutant that is a derivative of wild-type 
TCAP. 

[0162] In addition, assays that can be used to detect or 
measure the ability to inhibit, or alternatively promote the 
activities of the TCAPs described herein, are described in 
Section 5.7.4. 

[0163] Other methods will be known to the skilled artisan 
and are within the scope of the invention. 

5.7.2. TCAPS AS MARKERS OF T CELL 
ACTIVATION 

[0164] The TCAPs of the present invention are proteins 
specifically produced during T cell activation. Thus, these 
proteins, or the associated nucleic acids, in abundances 
exceeding that of the normal state (i.e., wherein T cells are 
not substantially activated), are markers of T cell activation. 
As such, they are useful markers for any condition for which 
the monitoring of the state of T cell activation is desirable. 
Thus, measuring one or more of the TCAPs or TCAP nucleic 
acids (e.g., mRNA, cDNAor cRNA) in a cell sample can be 
used to assess whether a person suffers a condition associ- 
ated with increased T cell activation, where T cell activation 
is undesirable, or lack of T cell activation, where T cell 
activation is desirable. A number of immune-related disor- 
ders or conditions, such as autoimmune disorders or severe 
combined immune disorder involve the undesirable activa- 
tion of T cells. Many physiological, pathological or thera- 
peutic conditions also involve T cell activation, such as 
bacterial, viral or organismal infections, and responses 
thereto, vaccinations and responses thereto, allergies and 
allergic reactions, immune therapies, transplants, and graft- 
versus-host disease. Conversely, some physiological, patho- 



logical or therapeutic conditions involve insufficient T cell 
activation, where T cell activation is desirable, such as 
acquired immune deficiency syndrome or chemotherapy. In 
a hospital, clinical or research setting, the ability to easily 
track the response of the immune system to various thera- 
pies, and to easily assess the immune status of a patient, 
would be a highly useful component of any course of 
treatment directly or indirectly affecting or involving the 
immune system. 

[0165] The present invention, therefore, provides markers 
of T cell activation useful for assessing the immune status of 
a person. Specifically, the invention provides for the use of 
the TCAPs TA-GPCR, TA-GAP, TA-WDRP, TA-NFKBH, 
TA-KRP,TA-WDRP and/or, TA-LRRP and the nucleic acids 
encoding them, as markers of T cell activation. These 
markers will assist in determining the efficacy of immune - 
suppressive therapies, for example, to monitor the effective- 
ness of drugs used to prevent graft-versus-host disease or of 
treatments for allergies or the suppression of the allergic 
response. The markers will also assist in monitoring the 
effectiveness of immune-promoting therapies, for example, 
certain vaccines, AIDS therapies, or SCID therapies. 

[0166] The use of TCAPs as markers is straightforward. 
First, antibodies to one or more TCAPs are raised or 
obtained according to the methods presented in Section 5.5. 
These antibodies are then used in an immunoassay to detect 
a particular TCAP, which immunoassay is carried out by a 
method comprising contacting a sample derived from a 
patient with the anti-TCAP antibody under conditions such 
that immunospecific binding can occur, and detecting or 
measuring the amount of any immunospecific binding by the 
antibody. In a specific aspect, such binding of antibody, in 
tissue sections or in patient samples, can be used to detect 
aberrant localization or aberrant (e.g., low, absent, or high) 
levels of a particular TCAP. In a specific embodiment, 
antibody(ies) to one or more TCAP can be used to assay in 
a patient tissue or serum sample for the presence of TCAP 
where an aberrant level of TCAP is an indication of a 
diseased condition. "Aberrant level" means an increased or 
decreased level relative to that present, or a standard level 
representing that present, in an analogous sample from a 
portion of the body or from a subject not having the disorder. 
In another specific embodiment, antibody(ies) to one or 
more TCAP can be used to assay in a patient tissue or serum 
sample increased or decreased levels of the TCAP(s) to 
assess the efficacy, stage, or progress of an immune system- 
promoting or immunosuppressive therapy, respectively. 

[0167] The immunoassays which can be used include but 
are not limited to competitive and non-competitive assay 
systems using techniques such as western blots, radioimmu- 
noassays, ELISA (enzyme linked immunosorbent assay), 
"sandwich" immunoassays, immunoprecipitation assays, 
precipitin reactions, gel diffusion precipitin reactions, 
immunodiffusion assays, agglutination assays, complement- 
fixation assays, immunoradiometric assays, fluorescent 
immunoassays, protein A immunoassays, etc. 

[0168] In similar fashion, mRNA encoding a particular 
TCAP can act as a marker for T cell activation, and, 
therefore, can be used in the same manner as TCAPs and 
antibodies to TCAPs. In this regard, RNA is extracted from 
a sample and is used in an assay capable of detecting the 
presence and amount of RNA present in a sample, such as 
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northern analysis, slot blots, microarray analysis, quantita- 
tive PCR, etc. TCAP-encoding nucleic acid sequences, or 
subsequences thereof comprising about at least eight (8) 
nucleotides, including complementary sequences, can be 
used as hybridization probes. Hybridization assays can be 
used to detect, prognose, diagnose, or monitor conditions, 
disorders, or disease states associated with aberrant changes 
in TCAP expression and/or activity as described supra. In 
particular, such a hybridization assay is carried out by a 
method comprising contacting a sample containing nucleic 
acid with a nucleic acid probe capable of hybridizing to 
TCAP mRNA, or nucleic acid derived therefrom, under 
conditions such that hybridization can occur, and detecting 
or measuring any resulting hybridization. In a specific 
embodiment, nucleic acids derived from TCAP mRNA, such 
as cDNAor cRNA, are measured. As used herein, cRNAis 
defined here as RNA complementary to the source RNA or 
its complement, i.e., complementary to either strand of a 
cDNA of the source RNA. The extracted RNAs are prefer- 
ably amplified using a process in which doubled-stranded 
cDNAs are synthesized from the RNAs using a primer 
linked to an RNA polymerase promoter in a direction 
capable of directing transcription of anti-sense RNA. Anti- 
sense RNAs or cRNAs are then transcribed from the second 
strand of the double-stranded cDNAs using an RNA poly- 
merase (see, e.g., U.S. Pat. Nos. 5,891,636, 5,716,785; 
5,545,522 and 6,132,997). Both oligo-dT primers (U.S. Pat. 
Nos. 5,545,522 and 6,132,997) or random primers (U.S. 
Provisional Patent Application Serial No. 60/253,641) that 
contain an RNA polymerase promoter or complement 
thereof can be used. Preferably, the target polynucleotides 
are short and/or fragmented polynucleotide molecules which 
are representative of the original nucleic acid population of 
the cell. In a most preferred embodiment, the nucleic acid 
probe is one of a plurality of different probes on a microar- 
ray. 

[0169] Collection of a sample from a patient can be by any 
means known in the art. For example, because T cells are 
blood cells, a patient sample can comprise a blood, serum, 
or plasma sample. In a specific embodiment, the sample 
comprises peripheral blood mononuclear cells (PBMCs). 
The sample may also comprise a tissue sample, drawn from 
a site of inflammation. Tissue can be biopsied or derived 
from any organ of the body affected, including bone and 
skin. Tissue can be obtained surgically or by fine needle 
aspiration. 

[0170] Typically, blood, serum, plasma or tissue samples 
from which RNA is to be extracted are quick frozen on dry 
ice. Samples are then homogenized together with a mortar 
and pestle under liquid nitrogen. A typical RNA extraction 
procedure is as follows. Total cellular RNA is extracted from 
tissue with either RNAzol™ or RNAzolB™ (Tel-Test, 
Friendswood, Tex.), according to the manufacturer's 
instructions. The tissue is solubilized in an appropriate 
amount of RNAzol™ or RNAzolB™, and RNA is extracted 
by the addition of 1/10 v/v chloroform to the solubilized 
sample followed by vigorous shaking for approximately 15 
seconds. The mixture is then centrifuged for 15 minutes at 
12,000 g and the aqueous phase removed to a fresh tube. 
RNA is then precipitated with isopropanol. The resultant 
RNA pellet is dissolved in water and re-extracted with an 
equal volume of chloroform to remove any remaining phe- 
nol. The extracted volume is precipitated with 2 volumes of 
ethanol in the presence of 150 mM sodium acetate. The 



precipitated RNA is then dissolved in water and the con- 
centration determined spectroscopically (A260). 

[0171] In specific embodiments, diseases and disorders 
involving reduced activation of T cells can be diagnosed, or 
their suspected presence can be screened for, or a predispo- 
sition to develop such disorders can be detected, by detect- 
ing decreased levels of TCAP protein, TCAP RNA, or TCAP 
functional activity (e.g., phosphatase activity, SH3 domain- 
binding activity, GTPase activity, ligand-binding activity, 
transcriptional activation activity, etc.), or by detecting 
mutations in TCAP RNA, DNA or protein (e.g., transloca- 
tions of a TCAP nucleic acid, truncations in a TCAP gene or 
protein, changes in nucleotide or amino acid sequence 
relative to wild-type TCAP) that cause decreased expression 
or activity of TCAP. Such diseases and disorders include but 
are not limited to immune function reduction or failure 
resulting from chemotherapy, HIV infection, septic shock, or 
severe combined immune deficiency. By way of example, 
reduced levels of a particular TCAP, in comparison to a 
normal or control sample, can be detected by immnunoas- 
say; levels of TCAP RNA can be detected by hybridization 
assays (e.g., Northern blots, dot blots); the activity of a 
particular TCAP can be measured using assays known in the 
art; translocations and point mutations in TCAP nucleic 
acids can be detected by Southern blotting, RFLP analysis, 
PCR using primers that preferably generate a fragment 
spanning at least most of a TCAP gene, sequencing of the 
TCAP genomic DNA or cDNA obtained from the patient; 
etc. Where levels of TCAPs, TCAP nucleic acid, or TCAP 
activity are to be measured, in some instances no TCAP, 
TCAP nucleic acid, or TCAP activity can be discerned in a 
sample, as compared to a normal or control sample. In this 
instance, the absence of the TCAP, TCAP nucleic acid or 
TCAP activity indicates the presence of a disease or disorder 
involving the reduced activation of T cells. 

[0172] In one embodiment, levels of TCAP mRNA or 
protein in a patient sample are detected or measured, in 
which decreased levels indicate that the subject has, or has 
a predisposition to developing, a disorder involving under- 
activation of T cells; in which the decreased levels are 
relative to the levels present in an analogous sample from a 
not having such a disorder. 

[0173] In another embodiment, diseases and disorders 
involving undesirable T cell proliferation, or in which T cell 
activation and/or proliferation is desirable for treatment, are 
diagnosed, or their suspected presence can be screened for, 
or a predisposition to develop such disorders can be 
detected, by detecting increased levels of a particular TCAP, 
or the RNA encoding the particular TCAP, or the functional 
activity of a particular TCAP (e.g., phosphatase activity, 
GTPase activity, GTPase activation activity, transcriptional 
activation activity, transducin-like activity, etc.), or by 
detecting mutations in the RNA, DNA or amino acid 
sequence of a particular TCAP (e.g., translocations in TCAP 
nucleic acids, truncations in the gene or protein, changes in 
nucleotide or amino acid sequence relative to wild-type 
TCAP) that cause increased expression or activity of a 
particular TCAP. Such diseases and disorders include but are 
not limited to graft-versus-host disease, allergic reactions, 
undesirable reactions to vaccinations, or autoimmune disor- 
ders in which the immune system recognizes a component of 
the body. By way of example, levels of TCAP protein, levels 
of TCAP RNA, TCAP kinase activity, TCAP binding activ- 
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ity, and the presence of translocations or point mutations can 
be determined as described above. 

[0174] In another embodiment, levels of TCAP nucleic 
acid or protein in a patient sample are detected or measured, 
in which increased levels indicate that the subject has, or has 
a predisposition to developing, a T cell activation disorder in 
which the increased levels are relative to the levels present 
in an analogous sample from a portion of the body or from 
a subject not having the disorder. 

[0175] Kits for diagnostic use are also provided that 
comprise in one or more containers an anti-TCAP antibody, 
and, optionally, a labeled binding partner to the antibody. 
Alternatively, the anti-TCAP antibody can be labeled with a 
detectable-moiety, e.g., a chemiluminescent, enzymatic, 
fluorescent, or radioactive moiety. A kit is also provided that 
comprises in one or more containers a nucleic acid probe 
capable of hybridizing to TCAP RNA. In a specific embodi- 
ment, a kit can comprise in one or more containers a pair of 
primers (e.g., each in the size range of 6-30 nucleotides) that 
are capable of priming amplification, e.g., by polymerase 
chain reaction (PCR; Innis et al., PCR Protocols, Academic 
Press, Inc., San Diego, Calif. (1990)), ligase chain reaction 
(EP 320,308) use of Q replicase, cyclic probe reaction, or 
other methods known in the art, under appropriate reaction 
conditions of at least a portion of a TCAP-encoding nucleic 
acid. A kit can optionally further comprise in a container a 
predetermined amount of a purified TCAP protein or nucleic 
acid, e.g., for use as a standard or control, and/or a container 
comprising a buffer in which PCR or another amplification 
reaction can be conducted, and/or a container comprising an 
enzyme (e.g., a polymerase) suitable for use in the ampli- 
fication reaction. 

5.7.3. THERAPEUTIC USES 

5.7.3.1. GENE THERAPY 

[0176] The invention also provides for treatment of vari- 
ous diseases and disorders by administration of a therapeutic 
compound (termed herein "Therapeutic"). Such "Therapeu- 
tics" include, but are not limited to: TCAPs and derivatives 
(including fragments) thereof (e.g., as described herein 
above); antibodies thereto (as described herein above); 
nucleic acids encoding the particular TCAP(s) or TCAP 
derivatives (e.g., as described herein above); antisense 
nucleic acids to nucleic acids encoding a particular TCAP, 
and agonists and antagonists. Disorders involving under- 
activation of T cells are treated by administration of a 
Therapeutic that promotes the function of a particular TCAP 
or set of TCAPs. Where T cell activity is sought to be 
reduced, e.g., in immunosuppressive therapy, reduction is 
accomplished by administration of a Therapeutic that 
antagonizes (inhibits) the function of a TCAP or set of 
TCAPs. The above is described in detail in the subsections 
below. 

[0177] Generally, administration of products of a species 
origin or species reactivity (in the case of antibodies) that is 
the same species as that of the patient is preferred. Thus, in 
a preferred embodiment, a human TCAP, derivative, or 
nucleic acid, or an antibody to a human TCAP, is therapeu- 
tically or prophylactically administered to a human patient. 

[0178] In a specific embodiment, the invention further 
provides a method of treating or preventing a disease or 



disorder involving undesirable T cell activation in a subject 
comprising administering to a subject in which such treat- 
ment is desired a therapeutically effective amount of a 
molecule that inhibits the function of at least one TCAP. In 
a more specific embodiment, the subject is a human. In a 
more specific embodiment, the invention provides the 
method above, wherein the molecule that inhibits TCAP 
function (i.e., the therapeutic) is selected from the group 
consisting of a TCAP derivative that is active in inhibiting 
cell proliferation, a nucleic acid encoding a TCAP, a nucleic 
acid encoding a TCAP derivative that is active in inhibiting 
cell proliferation, an anti-TCAP antibody or a fragment or 
derivative thereof containing the binding region thereof, a 
nucleic acid complementary to the RNA produced by tran- 
scription of a TCAP gene, and a nucleic acid comprising at 
least a portion of a TCAP gene into which a heterologous 
nucleotide sequence has been inserted such that said heter- 
ologous sequence inactivates the biological activity of the at 
least a portion of the TCAP gene, in which the TCAP gene 
portion flanks the heterologous sequence so as to promote 
homologous recombination with a genomic TCAP gene. In 
a further, more specific embodiment of the method above, 
the therapeutic that inhibits TCAP function is an oligonucle- 
otide that (a) consists of at least six nucleotides; (b) com- 
prises a sequence complementary to at least a portion of an 
RNA transcript of a TCAP gene; and (c) is hybridizable to 
the RNA transcript under moderately stringent conditions. In 
yet another specific embodiment of the above method, the 
molecule that inhibits TCAP function is a protein having at 
least 60% identity to a domain of a TCAP. 

[0179] The invention further provides a method of treating 
a disease or disorder involving a deficiency in cell prolif- 
eration or in which cell proliferation is desirable for treat- 
ment in a subject comprising administering to a subject in 
which such treatment is desired a therapeutically effective 
amount of a molecule that promotes TCAP function. 

[0180] In a specific embodiment, nucleic acids comprising 
a sequence encoding a TCAP or functional derivative 
thereof, are administered to promote TCAP function, by way 
of gene therapy. Gene therapy refers to therapy performed 
by the administration of a nucleic acid to a subject. In this 
embodiment of the invention, the nucleic acid produces its 
encoded protein that mediates a therapeutic effect by pro- 
moting TCAP function. 

[0181] Any of the methods for gene therapy available in 
the art can be used according to the present invention. 
Exemplary methods are described below. 

[0182] For general reviews of the methods of gene 
therapy, see Goldspiel et al., Clinical Pharmacy 12:488-505 
(1993); Wu and Wu, Biotherapy 3:87-95 (1991); Tolstoshev, 
Ann. Rev. Pharmacol. Toxicol. 32:573-596 (1993); Mulli- 
gan, Science 260:926-932 (1993); and Morgan and Ander- 
son, Ann. Rev. Biochem. 62:191-217 (1993); May, 
TIBTECH 11(5):155-215 (1993)). Methods commonly 
known in the art of recombinant DNA technology which can 
be used are described in Ausubel et al. (eds.), Current 
Protocols in Molecular Biology, John Wiley C Sons, NY 
(1993); and Kriegler, Gene Transfer and Expression, A 
Laboratory Manual, Stockton Press, NY (1990). 

[0183] In a preferred aspect, the Therapeutic comprises a 
TCAP-encoding nucleic acid that is part of an expression 
vector that expresses a TCAP protein or fragment or chi- 



US 2003/01 19024 Al 



22 



Jun. 26, 2003 



meric protein thereof in a suitable host. In particular, such a 
nucleic acid has a promoter operably linked to the TCAP 
gene coding region, said promoter being inducible or con- 
stitutive, and, optionally, tissue -specific. In another particu- 
lar embodiment, a nucleic acid molecule is used in which the 
TCAP coding sequences and any other desired sequences are 
flanked by regions that promote homologous recombination 
at a desired site in the genome, thus providing for intrach- 
romosomal expression of the TCAP nucleic acid (Koller and 
Smithies, Proc. Natl. Acad. Sci. U.S.A. 86:8932-8935 
(1989); Zijlstra et al., Nature 342:435-438 (1989)). 

[0184] Delivery of the nucleic acid into a patient may be 
either direct, in which case the patient is directly exposed to 
the nucleic acid or nucleic acid-carrying vector, or indirect, 
in which case, cells are first transformed with the nucleic 
acid in vitro, then transplanted into the patient. These two 
approaches are known, respectively, as in vivo or ex vivo 
gene therapy. 

[0185] In a specific embodiment, the nucleic acid is 
directly administered in vivo, where it is expressed to 
produce the encoded product. This can be accomplished by 
any of numerous methods known in the art, e.g., by con- 
structing it as part of an appropriate nucleic acid expression 
vector and administering it so that it becomes intracellular, 
e.g., by infection using a defective or attenuated retroviral or 
other viral vector (see U.S. Pat. No. 4,980,286), or by direct 
injection of naked DNA, or by use of microp article bom- 
bardment (e.g., a gene gun; Biolistic, DuPont), or coating 
with lipids or cell-surface receptors or transfecting agents, 
encapsulation in liposomes, microp articles, or microcap- 
sules, or by administering it in linkage to a peptide which is 
known to enter the nucleus, by administering it in linkage to 
a ligand subject to receptor-mediated endocytosis (see e.g., 
Wu and Wu, Biol. Chem. 262:4429-4432 (1987)) (which can 
be used to target cell types specifically expressing the 
receptors), etc. In another embodiment, a nucleic acid-ligand 
complex can be formed in which the ligand comprises a 
fusogenic viral peptide to disrupt endosomes, allowing the 
nucleic acid to avoid lysosomal degradation. In yet another 
embodiment, the nucleic acid can be targeted in vivo for cell 
specific uptake and expression, by targeting a specific recep- 
tor (see, e.g., PCT Publications WO 92/06180 published 
Apr. 16, 25, 1992 (Wu et al.); WO 92/22635 published Dec. 
23, 1992 (Wilson et al.); WO92/20316 published Nov. 26, 
1992 (Findeis et al.); W093/14188 published Jul. 22, 1993 
(Clarke et al.), WO 93/20221 published Oct. 14, 1993 
(Young)). Alternatively, the nucleic acid can be introduced 
intracellularly and incorporated within host cell DNA for 
expression, by homologous recombination (Koller and 
Smithies, Proc. Natl. Acad. Sci. U.S.A. 86:8932-8935 
(1989); Zijlstra et al., Nature 342:435-438 (1989)). 

[0186] In a specific embodiment, a viral vector that con- 
tains the TCAP-encoding nucleic acid is used. For example, 
a retroviral vector can be used (see Miller et al., Meth. 
Enzymol. 217:581-599 (1993)). These retroviral vectors 
have been modified to delete retroviral sequences that are 
not necessary for packaging of the viral genome and inte- 
gration into host cell DNA. The TCAP-encoding nucleic 
acid to be used in gene therapy is cloned into the vector, 
which facilitates delivery of the gene into a patient. More 
detail about retroviral vectors can be found in Boesen et al., 
Biotherapy 6:291-302 (1994), which describes the use of a 
retroviral vector to deliver the mdrl gene to hematopoietic 



stem cells in order to make the stem cells more resistant to 
chemotherapy. Other references illustrating the use of ret- 
roviral vectors in gene therapy are: Clowes et al., J. Clin. 
Invest. 93:644-651 (1994); Kiem et al., Blood 83:1467-1473 
(1994); Salmons and Gunzberg, Human Gene Therapy 
4:129-141 (1993); and Grossman and Wilson, Curr. Opin. in 
Genetics and Devel. 3:110-114 (1993). 

[0187] Adenoviruses are other viral vectors that can be 
used in gene therapy. Adenoviruses are especially attractive 
vehicles for delivering genes to respiratory epithelia. Aden- 
oviruses naturally infect respiratory epithelia where they 
cause a mild disease. Other targets for adeno virus-based 
delivery systems are liver, the central nervous system, 
endothelial cells, and muscle. Adenoviruses have the advan- 
tage of being capable of infecting non-dividing cells. Kozar- 
sky and Wilson, Current Opinion in Genetics and Develop- 
ment 3:499-503 (1993) present a review of adenovirus- 
based gene therapy. Bout et al., Human Gene Therapy 
5:3-10 (1994) demonstrate the use of adenovirus vectors to 
transfer genes to the respiratory epithelia of rhesus monkeys. 
Other instances of the use of adenoviruses in gene therapy 
can be found in Rosenfeld et al., Science 252:431-434 
(1991); Rosenfeld et al., Cell 68:143-155 (1992); and Mas- 
trangeli et al., J. Clin. Invest. 91:225-234 (1993). 

[0188] Adeno -associated virus (AAV) has also been pro- 
posed for use in gene therapy (Walsh et al., Proc. Soc. Exp. 
Biol. Med. 204:289-300 (1993). 

[0189] Another approach to gene therapy involves trans- 
ferring a gene to cells in tissue culture. The expression of the 
transferred gene may be controlled by its native promoter, or 
can be controlled by a non-native promoter (see Section 5.2, 
supra; Section 5.7.3.1, infra). In addition to transferring a 
nucleic acid comprising a nucleic acid sequence encoding an 
entire TCAP (i.e., equivalent to the wild type), the trans- 
ferred nucleic acids can encode a functional portion of a 
particular TCAP, or a protein having at least 60% sequence 
identity to a TCAP disclosed herein, as compared over the 
length of the particular TCAP, or protein (whichever is 
shorter) or a polypeptide having at least 60% sequence 
similarity to a TCAP fragment, as compared over the length 
of the TCAP fragment or polypeptide (whichever is shorter). 
Introduction of the nucleic acid into the cell is accomplished 
by such methods as electroporation, lipofection, calcium 
phosphate mediated transfection, or viral infection. Usually, 
the method of transfer includes the transfer of a selectable 
marker to the cells. The cells are then placed under selection 
to isolate those cells that have taken up and are expressing 
the transferred gene. Those cells are then delivered to a 
patient. 

[0190] In this embodiment, the nucleic acid is introduced 
into a cell prior to administration in vivo of the resulting 
recombinant cell. Such introduction can be carried out by 
any method known in the art, including but not limited to 
transfection, electroporation, microinjection, infection with 
a viral or bacteriophage vector containing the nucleic acid 
sequences, cell fusion, chromosome-mediated gene transfer, 
microcell-mediated gene transfer, spheroplast fusion, etc. 
Numerous techniques are known in the art for the introduc- 
tion of foreign genes into cells (see e.g., Loeffler and Behr, 
Meth. Enzymol. 217:599-618 (1993); Cohen et al., Meth. 
Enzymol. 217:618-644 (1993); Cline, Pharmac. Ther. 29:69- 
92 (1985)) and may be used in accordance with the present 
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invention, provided that the necessary developmental and 
physiological functions of the recipient cells are not dis- 
rupted. The technique should provide for the stable transfer 
of the nucleic acid to the cell, so that the nucleic acid is 
expressible by the cell and preferably heritable and express- 
ible by its cell progeny. 

[0191] The resulting recombinant cells can be delivered to 
a patient by various methods known in the art. In a preferred 
embodiment, epithelial cells are injected, e.g., subcutane- 
ously. In another embodiment, recombinant skin cells may 
be applied as a skin graft onto the patient. Recombinant 
blood cells (e.g., hematopoietic stem or progenitor cells) are 
preferably administered intravenously. The amount of cells 
envisioned for use depends on the desired effect, patient 
state, etc., and can be determined by one skilled in the art. 

[0192] Cells into which a nucleic acid can be introduced 
for purposes of gene therapy encompass any desired, avail- 
able cell type, and include but are not limited to epithelial 
cells, endothelial cells, keratinocytes, fibroblasts, muscle 
cells, hepatocytes; blood cells such as T lymphocytes, B 
lymphocytes, monocytes, macrophages, neutrophils, eosi- 
nophils, megakaryocytes, granulocytes; various stem or pro- 
genitor cells, in particular hematopoietic stem or progenitor 
cells, e.g., as obtained from bone marrow, umbilical cord 
blood, peripheral blood, fetal liver, etc. 

[0193] In a preferred embodiment, the cell used for gene 
therapy is autologous to the patient. 

[0194] In an embodiment in which recombinant cells are 
used in gene therapy, a TCAP-encoding nucleic acid is 
introduced into the cells such that it is expressible by the 
cells or their progeny, and the recombinant cells are then 
administered in vivo for therapeutic effect. In a specific 
embodiment, stem or progenitor cells are used. Any stem 
and/or progenitor cells which can be isolated and maintained 
in vitro can potentially be used in accordance with this 
embodiment of the present invention. Such stem cells 
include but are not limited to hematopoietic stem cells 
(HSC), stem cells of epithelial tissues such as the skin and 
the lining of the gut embryonic heart muscle cells, liver stem 
cells (PCT Publication WO 94/08598, published Apr. 28, 
1994), and neural stem cells (S temple and Anderson, Cell 
71:973-985 (1992)). 

[0195] Epithelial stem cells (ESCs) or keratinocytes can 
be obtained from tissues such as the skin and the lining of 
the gut by known procedures (Rheinwald, Meth. Cell Bio. 
(21A):229 (1980)). In stratified epithelial tissue such as the 
skin, renewal occurs by mitosis of stem cells within the 
germinal layer, the layer closest to the basal lamina. Stem 
cells within the lining of the gut provide for a rapid renewal 
rate of this tissue. ESCs or keratinocytes obtained from the 
skin or lining of the gut of a patient or donor can be grown 
in tissue culture (Rheinwald, Meth. Cell Bio. 21A:229 
(1980); Pittelkow and Scott, Mayo Clinic Proc. 61:771 
(1986)). If the ESCs are provided by a donor, a method for 
suppression of host versus graft reactivity (e.g., irradiation, 
drug or antibody administration to promote moderate immu- 
nosuppression) can also be used. 

[0196] With respect to hematopoietic stem cells (HSC), 
any technique which provides for the isolation, propagation, 
and maintenance in vitro of HSC can be used in this 
embodiment of the invention. Techniques by which this may 



be accomplished include (a) the isolation and establishment 
of HSC cultures from bone marrow cells isolated from the 
future host, or a donor, or (b) the use of previously estab- 
lished long-term HSC cultures, which may be allogeneic or 
xenogeneic. Non-autologous HSC are used preferably in 
conjunction with a method of suppressing transplantation 
immune reactions of the future host/patient. In a particular 
embodiment of the present invention, human bone marrow 
cells can be obtained from the posterior iliac crest by needle 
aspiration (see, e.g., Kodo et al., J. Clin. Invest. 73:1377- 
1384 (1984)). In a preferred embodiment of the present 
invention, the HSCs can be made highly enriched or in 
substantially pure form. This enrichment can be accom- 
plished before, during, or after long-term culturing, and can 
be done by any techniques known in the art. Long-term 
cultures of bone marrow cells can be established and main- 
tained by using, for example, modified Dexter cell culture 
techniques (Dexter et al., J. Cell Physiol. 91:335 (1977)) or 
Witlock-Witte culture techniques (Witlock and Witte, Proc. 
Natl. Acad. Sci. U.S.A. 79:3608-3612 (1982)). 

[0197] In a specific embodiment, the nucleic acid to be 
introduced for purposes of gene therapy comprises an induc- 
ible promoter operably linked to the coding region, such that 
expression of the nucleic acid is controllable by controlling 
the presence or absence of the appropriate inducer of tran- 
scription. 

5.7.3.1.1. ANTISENSE REGULATION OF 
EXPRESSION OF TCAP GENES 

[0198] In a specific embodiment, the function of a par- 
ticular TCAP is inhibited by use of antisense nucleic acids 
substantially complementary to the transcript from a TCAP- 
encoding gene. The present invention provides the thera- 
peutic or prophylactic use of nucleic acids of at least six 
nucleotides that are antisense to a gene or cDNA encoding 
TCAP or a portion thereof. A "TCAP antisense nucleic acid" 
as used herein refers to a nucleic acid that of hybridizes to 
a sequence-specific nucleic acid (preferably mRNA) seg- 
ment (i.e., not the poly -A tract of an mRNA) that encodes 
TA-GPCR, TA-GAP, TA-NFKBH, TA-KRP, TA-PP2C, TA- 
WDRP or TA-LRRP by virtue of some sequence comple- 
mentarity. The antisense nucleic acid may be complemen- 
tary to a coding and/or noncoding region of an mRNA 
encoding these TCAPs. Such antisense nucleic acids have 
utility as Therapeutics that inhibits TCAP function, and can 
be used in the treatment of disorders that result from T cell 
activation. 

[0199] The antisense nucleic acids of the invention can be 
oligonucleotides that are double-stranded or single -stranded, 
RNA or DNA or a modification or derivative thereof, which 
can be directly administered to a cell, or which can be 
produced intracellularly by transcription of exogenous, 
introduced sequences. 

[0200] The invention further provides pharmaceutical 
compositions comprising an effective amount of the TCAP 
antisense nucleic acids of the invention in a pharmaceuti- 
cally acceptable carrier, as described infra. 

[0201] In another embodiment, the invention is directed to 
methods for inhibiting the expression of a TCAP-encoding 
nucleic acid sequence in a prokaryotic or eukaryotic cell 
comprising providing the cell with an effective amount of a 
composition comprising a TCAP antisense nucleic acid of 
the invention. 
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[0202] TCAP antisense nucleic acids and their uses are 
described in detail below. 

5.7.3.1.2. TCAP ANTISENSE NUCLEIC ACIDS 

[0203] The TCAP antisense nucleic acids of the present 
invention are of at least six nucleotides and are preferably 
oligonucleotides (typically ranging from 6 to about 50 
oligonucleotides). In specific aspects, the oligonucleotide is 
at least 10 nucleotides, at least 15 nucleotides, at least 100 
nucleotides, or at least 200 nucleotides. The oligonucle- 
otides can be DNA or RNA or chimeric mixtures or deriva- 
tives or modified versions thereof, single -stranded or 
double-stranded. The oligonucleotide can be modified at the 
base moiety, sugar moiety, or phosphate backbone. The 
oligonucleotide may include other appending groups such as 
peptides, or agents facilitating transport across the cell 
membrane (see, e.g., Letsinger et al., Proc. Natl. Acad. Sci. 
U.S.A. 86:6553-6556 (1989); Lemaitre et al., Proc. Natl. 
Acad. Sci. U.S.A. 84:648-652 (1987); PCT Publication No. 
WO 88/09810, published Dec. 15, 1988) or blood-brain 
barrier (see, e.g., PCT Publication No. WO 89/10134, pub- 
lished Apr. 25, 1988), hybridization-triggered cleavage 
agents (see, e.g., Krol et al.. Bio Techniques 6:958-976 
(1988)) or intercalating agents (see, e.g., Zon, Pharm. Res. 
5:539-549 (1988)). 

[0204] In a preferred aspect of the invention, a TCAP 
antisense oligonucleotide is provided, preferably of single- 
stranded DNA. In a most preferred aspect, such an oligo- 
nucleotide comprises a sequence antisense to the sequence 
encoding one or more domains of a TCAP protein, most 
preferably, of a human TCAP protein. The oligonucleotide 
may be modified at any position on its structure with 
substituents generally known in the art. 

[0205] The TCAP antisense oligonucleotide may comprise 
at least one modified base moiety which is selected from the 
group including but not limited to 5-fluorouracil, 5-bromou- 
racil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xantine, 

4- acetylcytosine, 5-(carboxyhydroxylmethyl) uracil, 5-car- 
boxymethylaminomethyl-2-thiouridine, 5-carboxymethy- 
laminomethyluracil, dihydrouracil, beta-D-galactosylque- 
osine, inosine, N6-isopentenyladenine, 1-methylguanine, 

1- methylinosine, 2,2-dimethylguanine, 2-methyladenine, 

2- methylguanine, 3-methylcytosine, 5 -me thy Icy to sine, 
N6-adenine, 7-methylguanine, 5-methylaminomethyluracil, 

5- methoxyaminomethyl-2-thiouracil, 5 beta-D-manno- 
sylqueosine, 5'-methoxycarboxymethyluracil, 5-methox- 
yuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxy- 
acetic acid (v), wybutoxosine, pseudouracil, queosine, 
2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiou- 
racil, 5-methyluracil, uracil-5-oxy acetic acid methylester, 
uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3- 
amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-di- 
aminopurine. 

[0206] In another embodiment, the oligonucleotide com- 
prises at least one modified sugar moiety selected from the 
group including but not limited to arabinose, 2-fluoroarabi- 
nose, xylulose, and hexose. 

[0207] In yet another embodiment, the oligonucleotide 
comprises at least one modified phosphate backbone 
selected from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphora- 



midate, a thiophosphoamidate, a phosphordiamidate, a 
methylphosphonate, an alkyl phosphotriester, and a formac- 
etal or analog thereof. 

[0208] In yet another embodiment, the oligonucleotide is 
an a-anomeric oligonucleotide. An a-anomeric oligonucle- 
otide forms specific double -stranded hybrids with comple- 
mentary RNA in which, contrary to the usual (3-units, the 
strands run parallel to each other (Gautier et al., Nucl. Acids 
Res. 15:6625-6641 (1987)). 

[0209] The oligonucleotide may be conjugated to another 
molecule, e.g., a peptide, hybridization triggered cross- 
linking agent, transport agent, hybridization-triggered cleav- 
age agent, etc. 

[0210] Oligonucleotides of the invention may be synthe- 
sized by standard methods known in the art, e.g. by use of 
an automated DNA synthesizer (such as are commercially 
available from Biosearch, Applied Biosystems, etc.). As 
examples, phosphorothioate oligonucleotides may be syn- 
thesized by the method of Stein et al. Nucl. Acids Res. 
16:3209 (1988), methylphosphonate oligonucleotides can be 
prepared by use of controlled pore glass polymer supports 
(Sarin et al., Proc. Natl. Acad. Sci. U.S.A. 85:7448-7451 
(1988)), etc. 

[0211] In a specific embodiment, the TCAP antisense 
oligonucleotide comprises catalytic RNA, or a ribozyme 
(see, e.g., PCT International Publication WO 90/11364, 
published Oct. 4, 1990; Sarver et al., Science 247:1222-1225 
(1990)). In another embodiment, the oligonucleotide is a 
2'-0-methylribonucleotide (Inoue et al., Nucl. Acids Res. 
15:6131-6148 (1987)), or a chimeric RNA- DNA analog 
(Inoue et al., FEBS Lett. 215: 327-330 (1987)). 

[0212] In an alternative embodiment, the TCAP antisense 
nucleic acid of the invention is produced intracellularly by 
transcription from an exogenous sequence. For example, a 
vector can be introduced in vivo such that it is taken up by 
a cell, within which cell the vector or a portion thereof 
transcribed, producing an antisense nucleic acid (RNA) of 
the invention. Such a vector would contain a sequence 
encoding the TCAP antisense nucleic acid. Such a vector can 
remain episomal or become chromosomally integrated, as 
long as it can be transcribed to produce the desired antisense 
RNA. Such vectors can be constructed by recombinant DNA 
technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replica- 
tion and expression in mammalian cells. Expression of the 
sequence encoding the TCAP antisense RNA can be by any 
promoter known in the art to act in mammalian, preferably 
human, cells. Such promoters can be inducible or constitu- 
tive. Such promoters include but are not limited to: the SV40 
early promoter region (Bernoist and Chambon, Nature 
290:304-310 (1981)), the promoter contained in the 3' long 
terminal repeat of Rous sarcoma virus (Yamamoto et al., 
Cell 22:787-797 (1980)), the herpes thymidine kinase pro- 
moter (Wagner et al., Proc. Natl. Acad. Sci. U.S.A. 78:1441- 
1445 (1981)), the regulatory sequences of the metallothio- 
nein gene (Brinster et al., Nature 296:39-42 (1982)), etc. 

[0213] The antisense nucleic acids of the invention com- 
prise a sequence complementary to at least a portion of a 
"RNA transcript of a TCAP gene, preferably a human TCAP 
gene. However, absolute complementarity, although pre- 
ferred, is not required. A sequence "complementary to at 
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least a portion of an RNA," as referred to herein, means a 
sequence having sufficient complementarity to be able to 
hybridize with the RNA, forming a stable duplex; in the case 
of double-stranded TCAP antisense nucleic acids, a single 
strand of the duplex DNA may thus be tested, or triplex 
formation may be assayed. The ability to hybridize will 
depend on both the degree of complementarity and the 
length of the antisense nucleic acid. Generally, the longer the 
hybridizing nucleic acid, the more base mismatches with an 
RNA transcribed from a TCAP-encoding gene it may con- 
tain and still form a stable duplex (or triplex, as the case may 
be). One skilled in the art can ascertain a tolerable degree of 
mismatch by use of standard procedures to determine the 
melting point of the hybridized complex. The antisense 
nucleic acids of the present invention hybridize to the target 
nucleic acid under moderately stringent conditions, and 
more preferably hybridize under highly stringent conditions. 

5.7.3.1.3. THERAPEUTIC USE OF ANTISENSE 
NUCLEIC ACIDS TO TCAP-ENCODING GENES 

[0214] Antisense nucleic acids to the TCAP-encoding 
genes of the present invention can be used to treat disorders 
of a cell type that expresses, or preferably overexpresses, the 
particular TCAP to which the antisense nucleic acid is 
directed. In a specific embodiment, such a disorder is a 
hyperactivation of the immune system mediated by T cells. 
In more specific embodiment, such a disorder is an immune 
system disorder that results in, or is attributable to, the 
overexpression of TA-GPCR, TA-GAP, TA-NFKBH, TA- 
KRP, TA-PP2C, TA-WDRP or TA-LRRP. In a preferred 
embodiment, a single-stranded DNA antisense TCAP oligo- 
nucleotide is used. 

[0215] Cell types which express or overexpress TCAP 
RNA can be identified by various methods known in the art. 
Such methods include but are not limited to hybridization 
with a TCAP-specific nucleic acid (e.g. by Northern hybrid- 
ization, dot blot hybridization, in situ hybridization), observ- 
ing the ability of RNA from the cell type to be translated in 
vitro into qTCAP, immunoassay, etc. In a preferred aspect, 
primary tissue from a patient can be assayed for expression 
one or more TCAP prior to treatment, e.g., by immunocy- 
to chemistry or in situ hybridization. 

[0216] Pharmaceutical compositions of the invention (see 
Section 5.7.3.3), comprising an effective amount of a TCAP 
antisense nucleic acid in a pharmaceutically acceptable 
carrier, can be administered to a patient having a disease or 
disorder which is of a type that expresses or overexpresses 
a TCAP or TCAP RNA. 

[0217] The amount of TCAP antisense nucleic acid which 
will be effective in the treatment of a particular disorder or 
condition will depend on the nature of the disorder or 
condition, and can be determined by standard clinical tech- 
niques. Where possible, it is desirable to determine the 
antisense cytotoxicity of the tumor type to be treated in vitro, 
and then in useful animal model systems prior to testing and 
use in humans. 

[0218] In a specific embodiment, pharmaceutical compo- 
sitions comprising TCAP antisense nucleic acids are admin- 
istered via liposomes, micr op articles, or microcapsules. In 
various embodiments of the invention, it may be useful to 
use such compositions to achieve sustained release of the 
TCAP antisense nucleic acids. In a specific embodiment, it 



may be desirable to utilize liposomes targeted via antibodies 
to specific identifiable tumor antigens (Leone tti et al., 1990, 
Proc. Natl. Acad. Sci. U.S.A. 87:2448-2451 (1990); Ren- 
neisen et al., J. Biol. Chem. 265:16337-16342 (1990)). 

5.7.3.2. DEMONSTRATION OF THERAPEUTIC 
OR PROPHYLACTIC UTILITY 

[0219] The Therapeutics of the invention are preferably 
tested in vitro, and then in vivo for the desired therapeutic 
or prophylactic activity, prior to use in humans. 

[0220] For example, in vitro assays which can be used to 
determine whether administration of a specific Therapeutic 
is indicated, include in vitro cell culture assays in which a 
patient tissue sample is grown in culture, and exposed to or 
otherwise administered a Therapeutic, and the effect of such 
Therapeutic upon the tissue sample is observed. In one 
embodiment, a Therapeutic that reverses or reduces the 
activation of T cells is selected for therapeutic use in vivo. 
Many assays standard in the art can be used to assess such 
survival and/or growth; for example, cell proliferation can 
be assayed by measuring 3H-thymidine incorporation, by 
direct cell count, by detecting changes in transcriptional 
activity of known genes such as proto-oncogenes (e.g., fos, 
myc) or cell cycle markers; cell viability can be assessed by 
trypan blue staining, differentiation can be assessed visually 
based on changes in morphology, etc. 

[0221] In another embodiment, a Therapeutic is indicated 
for use which exhibits the desired effect, inhibition or 
promotion of T cell activation, upon a patient sample where 
the patient suffers a condition associated with T cell activa- 
tion. 

[0222] In various specific embodiments, in vitro assays 
can be carried out with a patient's T cells, to determine if a 
Therapeutic has a desired effect upon such cells. 

[0223] In another embodiment, T cells capable of being 
activated are plated out or grown in vitro, and exposed to a 
Therapeutic. The Therapeutic which results in a cell pheno- 
type that is more normal (i.e., less representative of a 
pre -neoplastic state, neoplastic state, malignant state, or 
transformed phenotype) is selected for therapeutic use. 
Many assays standard in the art can be used to assess 
whether a pre -neoplastic state, neoplastic state, or a trans- 
formed or malignant phenotype, is present. For example, 
characteristics associated with a transformed phenotype (a 
set of in vitro characteristics associated with a tumorigenic 
ability in vivo) include a more rounded cell morphology, 
looser substratum attachment, loss of contact inhibition, loss 
of anchorage dependence, release of proteases such as 
plasminogen activator, increased sugar transport, decreased 
serum requirement, expression of fetal antigens, disappear- 
ance of the 250,000 dalton surface protein, etc. (see Luria et 
al., GENERAL VIROLOGY, 3D ED., JOHN WILEY & 
SONS, New York pp. 436-446 (1978)). 

[0224] In other specific embodiments, the in vitro assays 
described supra can be carried out using a cell line, rather 
than a cell sample derived from the specific patient to be 
treated, in which the cell line is derived from or displays 
characteristic(s) associated with the malignant, neoplastic or 
pre -neoplastic disorder desired to be treated or prevented, or 
is derived from the cell type upon which an effect is desired, 
according to the present invention. 
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[0225] Compounds for use in therapy can be tested in 
suitable animal model systems prior to testing in humans, 
including but not limited to rats, mice, chicken, cows, 
monkeys, rabbits, etc. For in vivo testing, prior to adminis- 
tration to humans, any animal model system known in the art 
may be used. 

5.7.3.3. THERAPEUTIC/PROPHYLACTIC 
ADMINISTRATION AND COMPOSITIONS 

[0226] The invention provides methods of treatment (and 
prophylaxis) by administration to a subject of an effective 
amount of a Therapeutic of the invention. In a preferred 
aspect, the Therapeutic is substantially purified. The subject 
is preferably an animal, including but not limited to animals 
such as cows, pigs, horses, chickens, cats, dogs, etc., and is 
preferably a mammal, and most preferably human. In a 
specific embodiment, a non-human mammal is the subject. 

[0227] Formulations and methods of administration that 
can be employed when the Therapeutic comprises a nucleic 
acid are described in Section 5.7.1 above; additional appro- 
priate formulations and routes of administration can be 
selected from among those described herein below. 

[0228] Various delivery systems are known and can be 
used to administer a Therapeutic of the invention, e.g., 
encapsulation in liposomes, microp articles, microcapsules, 
recombinant cells capable of expressing the Therapeutic, 
receptor-mediated endocytosis (see, e.g., Wu and Wu, J. 
Biol. Chem. 262:4429-4432 (1987)), construction of a 
Therapeutic nucleic acid as part of a retroviral or other 
vector, etc. Methods of introduction include but are not 
limited to intradermal, intramuscular, intraperitoneal, intra- 
venous, subcutaneous, intranasal, epidural, and oral routes. 
The compounds may be administered by any convenient 
route, for example by infusion or bolus injection, by absorp- 
tion through epithelial or mucocutaneous linings (e.g., oral 
mucosa, rectal and intestinal mucosa, etc.) and may be 
administered together with other biologically active agents. 
Administration can be systemic or local. In addition, it may 
be desirable to introduce the pharmaceutical compositions of 
the invention into the central nervous system by any suitable 
route, including intraventricular and intrathecal injection; 
intraventricular injection may be facilitated by an intraven- 
tricular catheter, for example, attached to a reservoir, such as 
an Ommaya reservoir. Pulmonary administration can also be 
employed, e.g., by use of an inhaler or nebulizer, and 
formulation with an aerosolizing agent. 

[0229] In a specific embodiment, it may be desirable to 
administer the pharmaceutical compositions of the invention 
locally to the area in need of treatment; this may be achieved 
by, for example, and not by way of limitation, local infusion 
during surgery, topical application, e.g., in conjunction with 
a wound dressing after surgery, by injection, by means of a 
catheter, by means of a suppository, or by means of an 
implant, said implant being of a porous, non-porous, or 
gelatinous material, including membranes, such as sialastic 
membranes, or fibers. In one embodiment, administration 
can be by direct injection at the site (or former site) of a 
malignant tumor or neoplastic or pre-neoplastic tissue. 

[0230] In another embodiment, the Therapeutic can be 
delivered in a vesicle, in particular a liposome (see Langer, 
Science 249:1527-1533 (1990); Treat et al., in Liposomes in 
the Therapy of Infectious Disease and Cancer, Lopez-Ber- 



estein and Fidler (eds.), Liss, New York, pp. 353-365 (1989); 
Lopez-Berestein, ibid., pp. 317-327; see generally ibid.) 

[0231] In yet another embodiment, the Therapeutic can be 
delivered in a controlled release system. In one embodiment, 
a pump may be used (see Langer, supra; Sefton, CRC Crit. 
Ref. Biomed. Eng. 14:201 (1987); Buchwald et al., Surgery 
88:507 (1980); Saudek et al., N. Engl. J. Med. 321:574 
(1989)). In another embodiment, polymeric materials can be 
used (see Medical Applications of Controlled Release, 
Langer and Wise (eds.), CRC Pres., Boca Raton, Fla. (1974); 
Controlled Drug Bioavailability, Drug Product Design and 
Performance, Smolen and Ball (eds.), Wiley, N.Y (1984); 
Ranger and Pewas I J. Macromol. Sci. Rev. Macromol. 
Chem. 23:61 (1983); see also Levy et al., Science 228:190 
(1985); During et al., Ann. Neurol. 25:351 (1989); Howard 
et al., J. Neurosurg. 71:105 (1989)). In yet another embodi- 
ment, a controlled release system can be placed in proximity 
of the therapeutic target, i.e., the thymus, thus requiring only 
a fraction of the systemic dose (see, e.g., Goodson, in 
Medical Applications of Controlled Release, supra, vol. 2, 
pp. 115-138 (1984)). 

[0232] Other controlled release systems are discussed in 
the review by Langer (Science 249:1527-1533 (1990)). 

[0233] In a specific embodiment where the Therapeutic is 
a nucleic acid encoding a protein Therapeutic, the nucleic 
acid can be administered in vivo to promote expression of its 
encoded protein, by constructing it as part of an appropriate 
nucleic acid expression vector and administering it so that it 
becomes intracellular, e.g., by use of a retroviral vector (see 
U.S. Pat. No. 4,980,286), or by direct injection, or by use of 
microparticle bombardment (e.g., a gene gun; Biolistic, 
DuPont), or coating with lipids or cell-surface receptors or 
transfecting agents, or by administering it in linkage to a 
homeobox-like peptide which is known to enter the nucleus 
(see e.g., Joliot et al., Proc. Natl. Acad. Sci. U.S.A. 88:1864- 
1868 (1991)), etc. Alternatively, a nucleic acid Therapeutic 
can be introduced intracellularly and incorporated within 
host cell DNA for expression, by homologous recombina- 
tion. 

[0234] The present invention also provides pharmaceuti- 
cal compositions. Such compositions comprise a therapeu- 
tically effective amount of a Therapeutic, and a pharmaceu- 
tically acceptable carrier. In a specific embodiment, the term 
"pharmaceutically acceptable" means approved by a regu- 
latory agency of the Federal or a state government or listed 
in the U.S. Pharmacopeia or other generally recognized 
pharmacopeia for use in animals, and more particularly in 
humans. The term "carrier" refers to a diluent, adjuvant, 
excipient, or vehicle with which the therapeutic is admin- 
istered. Such pharmaceutical carriers can be sterile liquids, 
such as water and oils, including those of petroleum, animal, 
vegetable or synthetic origin, such as peanut oil, soybean oil, 
mineral oil, sesame oil and the like. Water is a preferred 
carrier when the pharmaceutical composition is adminis- 
tered intravenously. Saline solutions and aqueous dextrose 
and glycerol solutions can also be employed as liquid 
carriers, particularly for injectable solutions. Suitable phar- 
maceutical excipients include starch, glucose, lactose, 
sucrose, gelatin, malt, rice, flour, chalk, silica gel, sodium 
stearate, glycerol monostearate, talc, sodium chloride, dried 
skim milk, glycerol, propylene, glycol, water, ethanol and 
the like. The composition, if desired, can also contain minor 
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amounts of wetting or emulsifying agents, or pH buffering 
agents. These compositions can take the form of solutions, 
suspensions, emulsion, tablets, pills, capsules, powders, 
sustained-release formulations and the like. The composi- 
tion can be formulated as a suppository, with traditional 
binders and carriers such as triglycerides. Oral formulation 
can include standard carriers such as pharmaceutical grades 
of mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, etc. Examples 
of suitable pharmaceutical carriers are described in "Rem- 
ington's Pharmaceutical Sciences" by E. W. Martin. Such 
compositions will contain a therapeutically effective amount 
of the Therapeutic, preferably in purified form, together with 
a suitable amount of carrier so as to provide the form for 
proper administration to the patient. The formulation should 
suit the mode of administration. 

[0235] In a preferred embodiment, the composition is 
formulated in accordance with routine procedures as a 
pharmaceutical composition adapted for intravenous admin- 
istration to human beings. Typically, compositions for intra- 
venous administration are solutions in sterile isotonic aque- 
ous buffer. Where necessary, the composition may also 
include a solubilizing agent and a local anesthetic such as 
lignocaine to ease pain at the site of the injection. Generally, 
the ingredients are supplied either separately or mixed 
together in unit dosage form, for example, as a dry lyo- 
philized powder or water free concentrate in a hermetically 
sealed container such as an ampoule or sachette indicating 
the quantity of active agent. Where the composition is to be 
administered by infusion, it can be dispensed with an 
infusion bottle containing sterile pharmaceutical grade water 
or saline. Where the composition is administered by injec- 
tion, an ampoule of sterile water for injection or saline can 
be provided so that the ingredients may be mixed prior to 
administration. 

[0236] The Therapeutics of the invention can be formu- 
lated as neutral or salt forms. Pharmaceutically acceptable 
salts include those formed with free amino groups such as 
those derived from hydrochloric, phosphoric, acetic, oxalic, 
tartaric acids, etc., and those formed with free carboxyl 
groups such as those derived from sodium, potassium, 
ammonium, calcium, ferric hydroxides, isopropylamine, tri- 
ethylamine, 2-ethylamino ethanol, histidine, procaine, etc. 

[0237] The amount of the Therapeutic of the invention 
which will be effective in the treatment of a particular 
disorder or condition will depend on the nature of the 
disorder or condition, and can be determined by standard 
clinical techniques. In addition, in vitro assays may option- 
ally be employed to help identify optimal dosage ranges. 
The precise dose to be employed in the formulation will also 
depend on the route of administration, and the seriousness of 
the disease or disorder, and should be decided according to 
the judgment of the practitioner and each patient's circum- 
stances. However, suitable dosage ranges for intravenous 
administration are generally about 20-500 micrograms of 
active compound per kilogram body weight. Suitable dosage 
ranges for intranasal administration are generally about 0.01 
pg/kg body weight to 1 mg/kg body weight. Effective doses 
may be extrapolated from dose-response curves derived 
from in vitro or animal model test systems. 

[0238] Suppositories generally contain active ingredient in 
the range of 0.5% to 10% by weight; oral formulations 
preferably contain 10 % to 95% active ingredient. 



[0239] The invention also provides a pharmaceutical pack 
or kit comprising one or more containers filled with one or 
more of the ingredients of the pharmaceutical compositions 
of the invention. In one embodiment, the kit provides a 
container having a therapeutically-active amount of a TCAP. 
Optionally associated with such container(s) can be a notice 
in the form prescribed by a governmental agency regulating 
the manufacture, use or sale of pharmaceuticals or biological 
products, which notice reflects approval by the agency of 
manufacture, use or sale for human administration. 

5.7.4. SCREENING FOR TCAP AGONISTS AND 
ANTAGONISTS 

[0240] TCAP nucleic acids, proteins, and derivatives also 
have uses in screening assays to detect molecules that 
specifically bind to TCAP nucleic acids, proteins, or deriva- 
tives and thus have potential use as agonists or antagonists 
of TCAP, in particular, molecules that thus affect T cell 
activation and/or proliferation. In a preferred embodiment, 
such assays are performed to screen for molecules with 
potential utility as anti-cancer drugs or lead compounds for 
drug development. The invention thus provides assays to 
detect molecules that specifically bind to TCAP nucleic 
acids, proteins, or derivatives. For example, recombinant 
cells expressing TCAP nucleic acids can be used to recom- 
binantly produce TCAPs in these assays, to screen for 
molecules that bind to a TCAP. Molecules (e.g., putative 
binding partners of TCAP) are contacted with a particular 
TCAP or fragment thereof under conditions conducive to 
binding, and then molecules that specifically bind to the 
TCAP are identified. Similar methods can be used to screen 
for molecules that bind to TCAP derivatives or nucleic acids. 
Methods that can be used to carry out the foregoing are 
commonly known in the art. 

[0241] By way of example, diversity libraries, such as 
random or combinatorial peptide or nonpeptide libraries can 
be screened for molecules that specifically bind to a par- 
ticular TCAP. Many libraries are known in the art that can 
be used, e.g., chemically synthesized libraries, recombinant 
(e.g., phage display libraries), and in vitro translation-based 
libraries. 

[0242] Examples of chemically synthesized libraries are 
described in Fodor et al., Science 251:767-773 (1991); 
Houghten et al., Nature 354:84-86 (1991); Lam et al., Nature 
354:82-84 (1991); Medynski, Bio/Technology 12:709-710 
(1994); Gallop et al., J. Medicinal Chemistry 37(9): 1233- 
1251 (1994); Ohlmeyer et al., Proc. Natl. Acad. Sci. U.S.A. 
90:10922-10926 (1993); Erb et al., Proc. Natl. Acad. Sci. 
U.S.A. 91:11422-11426 (1994); Houghten et al., Biotech- 
niques 13:412 (1992); Jayawickreme et al., Proc. Natl. Acad. 
Sci. U.S.A. 91:1614-1618 (1994); Salmon et al., Proc. Natl. 
Acad. Sci. U.S.A. 90:11708-11712 (1993); PCT Publication 
No. WO 93/20242; and Brenner and Lerner, Proc. Natl. 
Acad. Sci. U.S.A. 89:5381-5383 (1992). 

[0243] Examples of phage display libraries are described 
in Scott and Smith, Science 249:386-390 (1990); Devlin et 
al., Science, 249:404-406 (1990); Christian, R. B., et al., J. 
Mol. Biol. 227:711-718 (1992)); Lenstra, J. Immunol. Meth. 
152:149-157 (1992); Kay et al., Gene 128:59-65 (1993); and 
PCT Publication No. WO 94/18318 published Aug. 18, 
1994. 

[0244] In vitro translation-based libraries include but are 
not limited to those described in PCT Publication No. WO 
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91/05058 published Apr.18, 1991; and Mattheakis et al., 
Proc. Natl. Acad. Sci. U.S.A. 91:9022-9026 (1994). 

[0245] By way of examples of nonpeptide libraries, a 
benzodiazepine library (see e.g., Bunin et al., Proc. Natl. 
Acad. Sci. U.S.A. 91:4708-4712 (1994)) can be adapted for 
use. Peptoid libraries (Simon et al., Proc. Natl. Acad. Sci. 
U.S.A. 89:9367-9371 (1992)) can also be used. Another 
example of a library that can be used, in which the amide 
functionalities in peptides have been permethylated to gen- 
erate a chemically transformed combinatorial library, is 
described by Ostresh et al. (Proc. Natl. Acad. Sci. U.S.A. 
91:11138-11142 (1994)). 

[0246] Screening the libraries can be accomplished by any 
of a variety of commonly known methods. See, e.g., the 
following references, which disclose screening of peptide 
libraries: Parmley and Smith, Adv. Exp. Med. Biol. 251:215- 
218 (1989); Scott and Smith, Science 249:386-390 (1990); 
Fowlkes et al., BioTechniques 13:422-427 (1992); Olden- 
burg et al., Proc. Natl. Acad. Sci. U.S.A. 89:5393-5397 
(1992); Yu et al., Cell 76:933-945 (1994); Staudt et al., 
Science 241:577-580 (1988); Bock et al., Nature 355:564- 
566 (1992); Tuerk et al., Proc. Natl. Acad. Sci. U.S.A. 
89:6988-6992 (1992); Ellington et al., Nature 355:850-852 
(1992); U.S. Pat. No. 5,096,815, U.S. Pat. No. 5,223,409, 
and U.S. Pat. No. 5,198,346, all to Ladner et al.; Rebar and 
Pabo, Science 263:671-673 (1993); and PCT Publication 
No. WO 94/18318, published Aug. 8, 1994. 

[0247] In a specific embodiment, screening can be carried 
out by contacting the library members with a TCAP protein 
(or nucleic acid or derivative) immobilized on a solid phase 
and harvesting those library members that bind to the protein 
(or nucleic acid or derivative). Examples of such screening 
methods, termed "panning" techniques are described by way 
of example in Parmley and Smith, Gene 73:305-318 (1988); 
Fowikes et al., BioTechniques 13:422-427 (1992); PCT 
Publication No. WO 94/18318; and in references cited 
herein above. 

[0248] In another embodiment, the two-hybrid system for 
selecting interacting proteins in yeast (Fields and Song, 
Nature 340:245-246 (1989); Chien et al., Proc. Natl. Acad. 
Sci. U.SA. 88:9578-9582 (1991)) can be used to identify 
molecules that specifically bind to a TCAP protein or 
derivative. 

[0249] In another embodiment, screening can be carried 
out by creating a peptide library in a prokaryotic or eukary- 
otic cells, such that the library proteins are expressed on the 
cells' surface, followed by contacting the cell surface with 
a TCAP and determining whether binding has taken place. 
Alternatively, the cells are transformed with a nucleic acid 
encoding a TCAP, such that the TCAP is expressed on the 
cells' surface. The cells are then contacted with a potential 
agonist or antagonist, and binding, or lack thereof, is deter- 
mined. In a specific embodiment of the foregoing, the 
potential agonist or antagonist is expressed in the same or a 
different cell such that the potential agonist or antagonist is 
expressed on the cells' surface. 

5.7.5. TRANSGENIC ANIMALS 

[0250] The invention also provides animal models. Trans- 
genic animals that have incorporated and express a consti- 
tutively-functional TCAP gene have use as animal models of 



diseases and disorders involving in T cell overactivation or 
over-proliferation, or in which cell proliferation is desired. 
Such animals can be used to screen for or test molecules for 
the ability to suppress activation and/or proliferation of T 
cells and thus treat or prevent such diseases and disorders. 
In one embodiment, animal models for diseases and disor- 
ders involving T cell activation (e.g., as described in Section 
5.7.5) are provided. Such animals can be initially produced 
by promoting homologous recombination between a TCAP 
gene in its chromosome and an exogenous TCAP gene that 
has been rendered biologically inactive. Preferably the 
sequence inserted is a heterologous sequence, e.g., an anti- 
biotic resistance gene. In a preferred aspect, this homolo- 
gous recombination is carried out by transforming embryo - 
derived stem (ES) cells with a vector containing an 
insertionally inactivated gene, wherein the active gene 
encodes a particular TCAP, such that homologous recom- 
bination occurs; the ES cells are then injected into a blas- 
tocyst, and the blastocyst is implanted into a foster mother, 
followed by the birth of the chimeric animal, also called a 
"knockout animal," in which a TCAP gene has been inac- 
tivated (see Capecchi, Science 244:1288-1292 (1989)). The 
chimeric animal can be bred to produce additional knockout 
animals. Chimeric animals can be and are preferably non- 
human mammals such as mice, hamsters, sheep, pigs, cattle, 
etc. In a specific embodiment, a knockout mouse is pro- 
duced. 

[0251] Such knockout animals are expected to develop or 
be predisposed to developing diseases or disorders involving 
T cell underproliferation and thus can have use as animal 
models of such diseases and disorders, e.g., to screen for or 
test molecules for the ability to promote activation or 
proliferation and thus treat or prevent such diseases or 
disorders. 

[0252] In a different embodiment of the invention, trans- 
genic animals that have incorporated and express a consti- 
tutively- functional TCAP gene have use as animal models of 
diseases and disorders involving in T cell overactivation, or 
in which T cell activation is desired. Such animals can be 
used to screen for or test molecules for the ability to suppress 
activation of T cells and thus treat or prevent such diseases 
and disorders. 

[0253] In particular, each transgenic line expressing a 
particular key gene under the control of the regulatory 
sequences of a characterizing gene is created by the intro- 
duction, for example by pronuclear injection, of a vector 
containing the transgene into a founder animal, such that the 
transgene is transmitted to offspring in the line. The trans- 
gene preferably randomly integrates into the genome of the 
founder but in specific embodiments may be introduced by 
directed homologous recombination. In a preferred embodi- 
ment, the transgene is present at a location on the chromo- 
some other than the site of the endogenous characterizing 
gene. In a preferred embodiment, homologous recombina- 
tion in bacteria is used for target-directed insertion of the key 
gene sequence into the genomic DNA for all or a portion of 
the characterizing gene, including sufficient characterizing 
gene regulatory sequences to promote expression of the 
characterizing gene in its endogenous expression pattern. In 
a preferred embodiment, the characterizing gene sequences 
are on a bacterial artificial chromosome (BAC). In specific 
embodiments, the key gene coding sequences are inserted as 
a 5' fusion with the characterizing gene coding sequence 
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such that the key gene coding sequences are inserted in 
frame and directly 3' from the initiation codon for the 
characterizing gene coding sequences. In another embodi- 
ment, the key gene coding sequences are inserted into the 3' 
untranslated region (UTR) of the characterizing gene and, 
preferably, have their own internal ribosome entry sequence 
(IRES). 

[0254] The vector (preferably a BAC) comprising the key 
gene coding sequences and characterizing gene sequences is 
then introduced into the genome of a potential founder 
animal to generate a line of transgenic animals. Potential 
founder animals can be screened for the selective expression 
of the key gene sequence in the population of cells charac- 
terized by expression of the endogenous characterizing 
gene. Transgenic animals that exhibit appropriate expression 
(e.g., detectable expression of the key gene product having 
the same expression pattern within the animal as the endog- 
enous characterizing gene) are selected as founders for a line 
of transgenic animals. 

[0255] Knockouts, including tissue -specific knockouts (in 
which the gene of interest is inactivated in particular tis- 
sues), can also be made by methods known in the art. 

[0256] Accordingly, the invention provides a transgenic 
animal that comprises a recombinant non-human animal in 
which a gene encoding a protein comprising SEQ ID NO: 3, 
SEQ ID NO: 4, SEQ ID NO: 7, SEQ ID NO: 9, SEQ ID NO: 
11, SEQ ID NO: 13, SEQ ID NO: 15, SEQ ID NO: 17, or 
SEQ ID NO: 19 has been inactivated by a method compris- 
ing introducing a nucleic acid into the plant or animal or an 
ancestor thereof, which nucleic acid or a portion thereof 
becomes inserted into or replaces said gene, or a progeny of 
such animal in which said gene has been inactivated. 

6. EXAMPLES 

[0257] The following examples are by way of illustration 
of the previously described invention, and are not limiting of 
that description in any way. In particular, the Examples 
presented herein below describe the analysis of the human T 
cell Activation GTPase Activating Protein and T cell Acti- 
vation G-Protein coupled Receptor. 

Example 1 

Identification of Genes Upregulated During T Cell 
Activation 

[0258] To identify genes upregulated during T cell acti- 
vation, FlexJet™ chips representing either 25,000 or 50,000 



Unigene clusters were hybridized to a mixture of cRNAs 
untreated versus treated cells of various types. FIG. 12 
depicts a series of experiments comparing activated and 
unactivated Jurkat cells, K562 cells, peripheral blood T 
cells, THP1 cells, NB4 cells, JCAM cells, HL60 cells, and 
B-lymphoblast cells. A total of 3853 genes regulated 
>3-fold, P<0.01 in a total of 104 experiments were analyzed 
by a two dimensional hierarchical clustering algorithm. This 
analysis groups genes showing the greatest similarity of 
regulation over all experiments (first dimension) and the 
experiments showing the greatest similarities in gene regu- 
lation (second dimension). For clarity, FIG. 12 depicts only 
a section of the total data set (64 genes and 94 experiments). 
Each experiment and each gene are represented on the X and 
Y axes, respectively. Experiments involving activated 
peripheral blood T cells and activated Jurkat T cells are 
indicated with horizontal black bars. Genes upregulated in a 
particular experiment are colored dark gray; genes down 
regulated in that experiment are colored light gray; and 
genes showing no regulation in a particular experiment are 
colored black. The set of genes shown here demonstrates 
enrichment for T cell cytokines. Of the 3853 genes clustered, 
24 (0.6%) encoded known cytokines. In the region shown, 
which comprises 64 genes, 9 (14%) were cytokines. Thus, 
there was a 27-fold enrichment for cytokine genes in this 
group. Known cytokine genes are highlighted with dark gray 
circles. This region also contains 21 ESTs of unknown 
function, which are indicated with black gray. 

[0259] 35 EST clusters were identified which clustered 
among T cell cytokines. When extended, these were found 
to represent 25 different transcripts. A total of 24 ESTs 
linked to known genes were identified (Table 1). Four of 
these 24 ESTs were found to map to introns of known genes. 
Ten of these 24 ESTs were found to overlap with cDNA 
sequences published during the course of this work. Fifteen 
of these ESTs were found to map in close proximity to the 
3' untranslated region (3' UTR) of known genes, and have 
been tentatively identified as extensions of these 3' UTRs. 
Three of these tentative identifications were confirmed by 
RT-PCR or genomic tiling (Bach2, TNFRSF9, IL2RA). 

[0260] The remainder identified seven novel transcripts 
encoding a new GPCR, three new potential signal transduc- 
ers (a phosphatase, a GTPase activating protein, and a 
WD-repeat containing protein); a potential NF-KB-like tran- 
scription factor, a keich motif-containing protein, and a 
leucine repeat-rich protein. These are discussed in more 
detail in Examples 3-9. 



TABLE 1 



Summary of ESTs identified as known genes by expression coregulation. 

Unigene ID 





Likely gene 


EST relationship 




Known T-Cell 


(Build #128, 




ESTs 


identity 


to gene 




activation gene 


Dec. 22, 2000) 


gDNA or new cDNA 


AA284303 


TNFSF8 


3' UTR of known 


gene 


yes 


Hs. 101370 


AL133412, NM_001244 


AI308959 


IL21R 


3' UTR of known 


gene 


yes 


Hs. 126232 


AC002303 


AI418535 


IL2RA 


3' UTR of known 


gene 


yes 


Hs. 130058 


AL137186, NM_000417 


AA211393 


TNFRSF9 


3' UTR of known 


gene 


yes 


Hs.86447 


AL009183, NM_001561 


AI624755 


TNFRSF9 


3' UTR of known 


gene 


yes 


Hs. 193418 


AL009183, NM_001561 


N63938 


Bach2 


3" UTR of known 


gene 


no 


Hs.88414 


AL353692 


AA825702 


Bach2 


3' UTR of known 


gene 


no 


Hs.88414 


AL353692 
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TABLE 1-continued 

Summary of ESTs identified as known genes by expression coregulation. 



Unigene ID 

Likely gene EST relationship Known T-Cell (Build #128, 

ESTs identity to gene activation gene Dec. 22, 2000) gDNA or new cDNA 



AA488974 


Bach2 


3' UTR of known gene 


no 


Hs.88414 


AL353692 


AA251113 


Bach2 


3' UTR of known gene 


no 


Hs.88414 


AL353692 


AI655183 


REL 


3' UTR of known gene 


yes 


Hs. 105251 


AC0 10733, NM_002908 


AI652899 


REL 


3' UTR of known gene 


yes 


Hs.86671 


AC010733, NM_002908 


AA210906 


REL 


3' UTR of known gene 


yes 


Hs. 188751 


AC010733, NM_002908 


AI497657 


GNG4 


3' UTR of known gene 


no 


Hs. 135184 


AL162611, NM_004485 


AI608902 


B7-H1 


3' UTR of known gene 


yes 


Hs.106149 


NM_014143 


AI683598 


HSP105B 


3' UTR of known gene 


no 


Hs. 20 16 15 


AL137142, NM_006644 


AI439019 


TBX21 


Identical to new cDNA 


yes 


Hs. 272409 


NM_013351 


U19261 


TRAF1 


Identical to new cDNA 


yes 


Hs.2134 


NM_005658 


AI201323 


G18 


Identical to new cDNA 


no 


Hs.8257 


NM_013324 


AI377661 


PLSCR2 


Identical to new cDNA 


no 


Hs. 123411 


NM_020359 


AI148659 


Fibronectin 1 


Identical to new cDNA 


yes 


Hs. 287820 


AC026342, NM002153 


AI073984 


ICSPB1 


Identical to new cDNA 


yes 


Hs. 14453 


NM_002163 


AI681868 


PBEF 


Intro n of known gene 


yes 


Hs. 178784 


AC007032 


AI092511 


CD26 


Intron of known gene 


yes 


Hs. 134533 


AC008063 


AA708350 


CDK6 


Intron of known gene 


yes 


Hs.189016 


AC000065, NM_001259 



Example 2 

Linkage of Exons Into Unigene Clusters by Array 
Expression Profiling 

[0261] For details of the array-based techniques of exon 
clustering, mapping and extension using ESTs, see U.S. pat. 
app. No. 09/781,814. 

[0262] FIG. 13 depicts the use of array data to assign 
different sequences to the same transcript. Consensus 
sequences from two previously unlinked Unigene clusters, 
Hs. 7581 and Hs. 130864 (FIG. 13A) were mapped to a 
portion of human chromosome 6 as follows. FlexJetTM 
scanning arrays were synthesized specifying alternating 
sense and antisense oligonucleotides from every tenth nucle- 
otide position in a genomic region encoding Unigene EST 
clusters, Hs. 7581 and Hs. 130864 on chromosome 6. 
Repetitive sequences in the genomic sequence were masked 
with the software program "RepeatMasker". Nested 60 mer 
oligonucleotides were selected from every tenth position of 
both strands of non-repetitive sequence. FIG. 13B shows the 
array hybridized with a mixture of cRNA from activated 
(labeled with red fluorescent dye) and unactivated (labeled 
with green dye) Jurkat cells. Cells were activated by incu- 
bation for 4 hrs at 37° C. on plastic culture flasks coated with 
anti-TCR Vbeta8 monoclonal antibody (mAb) (Pharmin- 
gen), in the presence of PMA (10 nM) and soluble anti- 
CD28 (mAb) 9.3 jug/ml. Array data (FIG. 13B) showed 
contiguous hybridization, suggesting that this region, and 
therefore Hs. 7581 and Hs. 130864, hybridize with a single 
transcript. 

[0263] The correlation between Hs. 7581 and Hs. 130864 
was determined by XDEV measurements of hybridization 
over the region of chromosome 6 adjacent to Unigene 
clusters (FIG. 13C) Hs. 7581 and Hs. 130864. XDEV is a 
statistic denning the significance of a hybridization ratio in 
a two-color experiment: 



[0264] where a 12 are the intensities measured in the two 
channels for each spot, p 1 2 are the uncertainties due to 
background subtraction, and f is a fractional multiplicative 
error such as would come from hybridization non-unifor- 
mities, fluctuations in the dye incorporation efficiency, scan- 
ner gain fluctuations, etc. Higher XDEV measurements 
represent more significant hybridization ratios. The region in 
FIG. 13B between the white circles corresponds to the peak 
of XDEV measurements. 

[0265] The linkage of these EST clusters was confirmed 
by RT-PCR analysis. Further extension of these EST clusters 
by RT-PCR analysis revealed that this genomic region 
represents an exon from the 3' untranslated region of the 
human homolog of the transcription factor, Bach2. 

Example 3 

Identification of TA-GAP 

[0266] The cloning of the gene encoding human T cell 
activation-associated GTPase activating protein (TA-GAP), 
and analysis of the protein, was accomplished as follows. 

[0267] Human peripheral blood mononuclear cells were 
activated for 5 days with phytohemaglutinin (PHA), rested 
for one day in medium lacking PHA, and re stimulated for 
the various periods of time on anti-CD3 (Pharmingen) 
coated plastic wells. At the indicated times, cells were 
harvested, cellular RNA was prepared, and amplified into 
cRNA. Hybridizations to human 25 k gene chips were 
performed with a mixture of cRNA from activated cells (red 
dye) and unactivated cells (green dye). FIG. 14 shows the 
time course of genes upregulated or downregulated during T 
cell activation. Transcripts showing significant regulation 
(>2-fold change and P<0.0001 in most samples) are shaded 
light gray. Transcripts encoding GAP-domain-containing 
proteins are depicted as dark gray lines. The TA-GAP 
transcript is depicted by the thick dark gray line, and 
transcripts for 18 other GAP domain-containing proteins 
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(KIAA1501, KIAA0660, AI479025, ABR, GIT1, GIT2, 
ARHGAP1, ARHGAP4, G38P, GAPCENA, GAPL, 
IQGAP1, IQGAP2, NGAP, RAB3GAP, RANGAP1, 
RAP1GA1, RASA1) are depicted by thin dark gray lines. Of 
the transcripts tested that encode GAP-domain containing 
proteins, TA-GAP is the only one to show significant 
up regulation upon T cell activation. 

[0268] TA-GAP was identified by investigation of a tran- 
script corresponding to an EST, AI253155. AI253155 was 
found to be coregulated with T cell cytokine transcripts, and 
was homologous to a genomic clone, AL035530, on chro- 
mosome 6q25.3-27. An ENSEMBL predicted transcript, 
ENST00000037330, mapped 5' to EST AI253155. The 
predicted transcript encoded a protein having homology to a 
GTPase -activator protein domain. cDNA corresponding to 
actual transcripts was amplified by RT-PCR using RNAfrom 
activated Jurkat cells as template, cloned and subjected to 
DNA sequence analysis. 

[0269] Two cDNA sequences were identified (Table 2). 
The nucleotide sequences of the cDNAs were used to query 
the GenBank sequence database operated by the National 
Library of Medicine, in a BLAST (Basic Local Alignment 
Search Tool) search. A BLAST search returns an Expect (E) 
value; the E value is the probability that a particular search 
result would have occurred by chance. Highly significant E 
values are greatly smaller than 1.0 (but larger than 0.0), 
while insignificant E values are close to 1.0. Similarity of 
protein sequences was calculated after the manner of 
BLAST 2.0. Specifically, Amino acids paired by sequence 
alignment were compared using the BLOSUM62 scoring 
matrix (for a methods review, see: W R Pearson. Effective 
protein sequence comparison. Methods Enzymol 
1996;266:227-58). BLOSUM62 is a rectangular matrix of 
values placed on each pair of aligned amino acids. The 
amino-acid pair values are designed to reflect the likelihood 
of amino acid replacement in conserved proteins. Positive 



scores are given to identities and conservative substitutions. 
Zero or negative scores are given for nonconservative sub- 
stitutions. 

[0270] For the purposes of generating these numbers, the 
column corresponding to each patent-sequence amino acid 
was found in the BLOSUM62 matrix. The appropriate row 
of BLOSUM62 was found for each aligned amino acid in the 
target sequence. The score at the intersection of the row and 
column was examined. If the number was positive, the 
amino acids were determined to be similar. If it was nega- 
tive, the amino acids were determined not to be similar. 
Similarites were summed across alignments in the same 
manner as identities were summed. Amino acid sequences of 
the predicted protein products were compared to entries in 
two protein motif databases, Pfam and PROSITE. A Pfam 
score close to 0.0 indicates that the match(es) returned is 
highly significant. 

[0271] The first cDNA sequence (Table 2: SEQ ID NO: 1) 
contained a full open reading frame that encoded a protein 
identical to the predicted protein from the ENSEMBL pre- 
dicted transcript, but contained an additional 105 amino 
acids at the amino terminus (Table 3, SEQ ID NO: 3). The 
second cDNA sequence was a putative splice variant (Table 
2, SEQ ID NO: 2), which contained a full open reading 
frame, but which encoded a smaller protein identical to the 
ENSEMBL predicted protein (Table 3, SEQ ID NO: 4). 
SEQ. ID NO: 1 aligned with its putative translation product 
SEQ ID NO: 2, and SEQ ID NO: 3 aligned with its putative 
translation product SEQ ID NO: 4, are depicted in FIGS. 
1A-1E and 2A-2D, respectively. Analysis of the TA-GAP 
transcript during T cell activation revealed that it was 
transiently expressed and reached maximal levels after 
approximately four hours of activation (FIG. 14). There 
were 18 other GAP domain genes represented on the human 
25 k chip used in these experiments, and TA-GAP was more 
highly regulated than any of these (FIG. 13). 



TABLE 2 



BLAST results for two TA-GAP-encoding cDNA sequences. 



Novel cDNA Polypeptide Novel cDNA Blast 
SEQ ID NO SEQ ID NO Score 



Novel cDNA Blast 
Description 



125 bp % 275 bp % 100% 
Identity Identity Identity Length 



3947 E = 0 AL035530.1 Human 

DNA 

sequence from clone RPI 
(genomic BAC clone) 

393 E = le-106 (exon 7) 

294 E = 2e-55 (exon 8) AK025272 

Homo sapiens 
FLJ21619 fis 
AL0355350.1 Human 
DNA 

sequence from clone RPI 
(genomic BAC clone) 



3947 E = 0 



393 E = le-106(exon 7) 
224 E = 2e-55 (exon 8) 



AK025272 

Homo sapiens cDNA: 
FLJ21619 fis 



100%. 



100% 



100% 



100% 



100% 



72% 



100% 



72% 



1991 



198 



1991 



198 
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[0272] 

TABLE 3 



Protein database search results for two TA-GAP variants. 













Pro site 


100% Identity 


100% Similarity 


SEQ ID NO 


Blast Scor 


e 


Blast Description 


Pfam Motif(s) 


Motif(s) 


Length 


Length 


3 


1. 161, E . 


= 3e-38 


1. BAA92629.1 
(AB037812) 
KIAA1391 protein 
[Homo sapiens] 


RhoGAP domain 
(from residue 101 to 
residue 250) score = 
101,3, E = 8.1e-28 


None 

remarkable 


5 


10 




2. 93, E = 


le-17 


2, A49678 GTPase- 
activating protein 
RhoGAP 






7 


11 


4 


1. 65, E = 


23-09 


1. BAA92629.1 
(AB037812) 
KIAA1391 protein 
[Homo sapiens] 


RhoGAP domain 
(from residue 6 to 
residue 96) score = 
-59.9, E = 0.31 


None 

remarkable 


4 


9 




2. 45.1, E 


= 2e-03 


2. NP_061830 
SH3- domain 
binding protein 1) 






3 


8 



Example 4 

Identification of TA-GPCR 

[0273] The cloning of the gene encoding human T cell 
activation associated G protein-coupled receptor (TA- 
GPCR), and analysis of the protein, was accomplished as 
follows. 

[0274] Analysis of the TA-GPCR transcript during T cell 
activation revealed that it reached maximal levels after 
approximately six hours of activation (FIG. 15). 27 other 
GPCR genes were represented on the human 25 k chip used 
in these experiments, and TA-GPCR was more highly regu- 
lated than any of these. Transcripts showing significant 
regulation (>2-fold change and P<0.0001 in most samples) 
in the experiment shown in FIG. 15 are depicted as thin gray 
lines. Transcripts encoding GPR proteins are colored red. 
The TA-GPCR transcript is depicted by the thick dark gray 
line, and transcripts for 27 other GPCR proteins are depicted 
by thin dark gray lines (GPR39, GPR51, AI61367, 
AI208357, GPRK6, GPRK5, GPR51, GPR19, AI659657, 
GPR48, EBI2, GPRK5, GPRK6, GPR68, GPR4, GPR9, 
LANCL1, CCR1, CCR4, CCR5, CCR7, CCR8, CMKLR1, 
CXCR4, HM74, LTBR4, AA040696). Of the transcripts 
tested that encode GPCRs, the ones encoding TA-GPCR 
were the only ones to show significant upregulation. 



[0275] TA-GPCR was identified by investigation of a 
transcript corresponding to an EST, AA040696. AA040696 
was coregulated with T cell cytokine transcripts, and was 
homologous to a genomic clone, AC026331, on chromo- 
some 12. An ENSEMBL predicted transcript, 
AC026331. 00004.443292, mapped 5' to EST AA040696. 
The predicted transcript encoded a protein having homology 
to a novel GPR. cDNAs corresponding to actual transcript(s) 
were amplified by RT-PCR from RNA isolated from acti- 
vated Jurkat cells as template, cloned and subjected to DNA 
sequence analysis. Two cDNA sequences (SEQ ID NOS: 5, 
6) were identified, in roughly equivalent amounts. Both 
contained a full open reading frame, and both encoded a 
protein (SEQ ID NO: 7) identical to the predicted protein 
from the ENSEMBL predicted transcript. Alignment of the 
predicted ORF of SEQ ID NOS: 5 and 6 with the putative 
translation product SEQ ID NO: 7 are shown in FIGS. 
3A-3E and 4A-4C, respectively. Nucleic acid and amino 
acid sequence comparisons, performed as described in 
Example 3, revealed that the cDNAs and predicted protein 
product had high sequence homology to G protein-coupled 
receptors (Tables 4, 5). Based on BLAST search results, 
TA-GPCR is a Class A GPCR. 



TABLE 4 



BLAST search results for two TA-GPCR- encoding cDNA sequences. 

Novel cDNA Polypeptide Novel cDNA Blast Novel cDNA Blast 125 bp % 275 bp % 100% 

SEQ ID NO SEQ ID NO Score Description Identity Identity Identity Length 



5 7 357, E = 3e-95 AL354720.14 Human 93.6% 90.6% 49 

DNA sequence from 
clone RP11-5-5F3) 

351 E = 3e-95 AC005529.7 94.4% 90.9% 52 
Homo sapiens 
chromosome 22ql2 clone 

6 7 349, E = 4e-93 AL109923.29 Human 94.4% 91.3% 34 

DNA sequence from 
clone RP3-46801 
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TABLE 4-continued 



BLAST search results for two TA-GPCR-encoding cDNA sequences. 

Novel cDNA Polypeptide Novel cDNA Blast Novel cDNA Blast 125 bp % 275 bp % 100% 

SEQ ID NO SEQ ID NO Score Description Identity Identity Identity Length 

345 E = 6e-92 AC005912.1 92% 90.9% 29 

Homo sapiens 
chromosome 
12pl3.3 BAC 
RPCI11-543P15 



[0276] 

TABLE 5 

Protein database search results for two TA-GPCR Variants. 

100% Identity 100% Similarity 



SEQ ID NO Blast Score 


Blast Description 


Pfam Motif(s) 


Pro site Motif (s) 


Length 


Length 


7 325 


NP_00 6009.1 


7tm_l, 7 


Residues 107-123, 


14 


25 


E = 7e-88 


putative chemokine 


transmembrane 


PDOC00210 PS00237 








receptor (HM74) 


receptor (rhodopsin 


G_PROTEIN_ 










family) domain 


RECEP_F1_ 










(residues 32-202), 


IG-protein 










score = 95.1, 


coupled receptors 










E = 5e-21 


family 1 signature 






320, E = 2e-86 


AJ300198 












Putative seven 












transmembrane 












spanning receptor 











[0277] TA-GPCR and the other indicated GPRs were 
subjected to multiple sequence alignment using BlockMaker 
(available on the Internet at blocks.fhcrc.org). This sequence 
comparison of the amino acid sequence of TA-GPCR with 
that of other G protein-coupled receptors revealed that 
TA-GPCR was more closely related to adenosine receptors 
than chemokine receptors. 



ID NO: 9 are shown in FIG. 5. Nucleic acid and amino acid 
sequence comparisons, performed as described in Example 
3, revealed that the predicted protein product contained a 
sequence at amino acid residues 128-172 homologous to a 
protein phosphatase class 2C domain (Tables 6, 7). 
TA-PP2C is predicted to be a serine -threonine class 2C 
phosphatase. 



TABLE 6 





BLAST search results for a TA-PP2C- encoding cDNA sequence. 




Novel cDNA 
SEQ ID NO 


Polypeptide Novel cDNA Novel cDNA 125 bp % 275 bp % 
SEQ ID NO Blast Score Blast Description Identity Identity 


100% Identity 
Length 


8 


9 255, E = 0 ACC002350 100% 100% 
Human chr 12q24 
PAC RPC13- 
424M6 


2666 



Example 5 

Identification of TA-PP2C 

[0278] The cloning of a cDNA encoding human T cell 
activation associated serine-threonine class 2C phosphatase 
(TA-PP2C), and analysis of the encoded protein, was 
accomplished generally as described in Examples 3 and 4. A 
cDNAof 3748 nucleotides (SEQ ID NO: 8) was identified, 
which contained a full open reading frame predicted to 
encode a protein of 304 amino acids (SEQ ID NO: 9). An 
alignment of SEQ ID NO: 8 and its predicted product SEQ 



[0279] 

TABLE 7 



Protein database search results for TA-PP2C. 

100% 

SEQ 100% Simi- 

ID Blast Blast Pfam Prosite Identity larity 

NO Score Description Motif(s) Motif(s) Length Length 



9 255, AAF47506 Protein N/A 13 22 

E = 6e-67 CG12091 phospha- 
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TABLE 7-continued 



Protein database search results for TA-PP2C. 



SEQ 
ID Blast 
NO Score 



Blast 

Description 



Pfam 
Motif(s) 



Prosite 
Motif(s) 



100% 
100% Simi- 
Identity larity 
Length Length 



gene product tase 2C 
(Drosophila) domain, 
residues 
128-172 



Example 6 

Identification of TA-NFKIBH 

[0280] The cloning of a cDNA encoding human T cell 
activation associated NF-KB-like transcription factor (TA- 



NFKBH), and analysis of the encoded protein, was accom- 
plished generally as described in Examples 3 and 4. AcDNA 
of 1736 nucleotides (SEQ ID NO: 10) and a cDNAof 1834 
nucleotides were identified (SEQ ID NO: 12), which con- 
tained a full open reading frames predicted to encode 
proteins of 465 amino acids (SEQ ID NO: 11) and 313 
amino acids (SEQ ID NO: 13), respectively. The short 
variant has the same amino acid sequence as SEQ ID NO: 
11, amino acids 153-465. An alignment of SEQ ID NO: 10 
to SEQ ID NO: 11, and SEQ ID NO: 12 to SEQ ID NO: 13 
are shown in FIGS. 6 and 7, respectively. Nucleic acid and 
amino acid sequence comparisons, performed as described 
in Example 3, revealed that both predicted protein products 
had Ank (ankyrin-like) repeats, which are involved in pro- 
tein-protein interactions. The long form has Ank repeats at 
residues 200-439, particularly in 236-268, 269-301 and 
395-431. Both forms show sequence homology to NF-kB or 
to MAIL, a murine kB transcriptional activator (Tables 8, 9). 

-like transcription 

factor. 



TABLE 8 



BLAST search results for two TA-NFKBH-encoding cDNA sequences. 

Novel cDNA Polypeptide Novel cDNA Novel cDNA 125 bp % 275 bp % 100% Identity 

SEQ ID NO SEQ ID NO Blast Score Blast Description Identity Identity Length 



10 



12 



11 (long) 739, E = 0 



13 (short) 739, E = 0 



AD000864 
Human DNA 
sequence from 
chr. 19, cosmid 
R28051 
AD000864 
Human DNA 
sequence from 
chr. 19, cosmid 
R28051 



100% 



100% 



100% 



100% 



373 



373 



[0281] 

TABLE 9 





Protein database search results for two TA-NFKBH variants. 












100% Identity 


100% Similarity 


SEQ ID NO Blast Score 


Blast Description Pfam Motif (s) 


Prosite Motif(s) 


Length 


Length 


11 (long) 179, E = 8e-44 


BAB 18302 MAIL 5 Ank repeats 


Proline-rich region, 


11 


12 




(Mus musculus) 


residues 70-177 






87, E = 4e-16 


NP_003989 


Ank repeats, residues 


6 


7 




NF-kappa B pl05 


200-439, 236-268, 








homolog 


269-301, 395-431 






13 (short) 197, E = 2e-49628, 


BAB 18302 MAIL 5 Ank repeats, 


Ank repeats 


11 


12 


E = e-179 


(Mus musculus) 84-279 








100, E = 2e-20 


NP_005 169.1 




5 


12 




B-cell 










CLL/lymphoma 
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Example 7 

Identification of TA-WDRP 

[0282] The cloning of a cDNA encoding human T cell 
activation associated transducin-like protein with WD 
motifs (TA-WDRP), and analysis of the encoded protein, 
was accomplished generally as described in Examples 3 and 
4. A cDNA of 3049 nucleotides (SEQ ID NO: 14) was 
identified, which contained a full open reading frame pre- 
dicted to encode a protein of 951 amino acids (SEQ ID NO: 
15). An alignment of SEQ ID NO: 14 to SEQ ID NO: 15 is 
shown in FIG. 8. Nucleic acid and amino acid sequence 
comparisons, performed as described in Example 3, revealed 
that the cDNAs and predicted protein product had sequence 
homology to transducins, which are G-proteins (Tables 10, 
11). TA-WDRP is also predicted to contain a WD motif 
repeats at amino acid residues 116-149, 180-216, 223-259, 
362-398, 407-443, 449-484, and 490-526. 



Example 8 

Identification of TA-KRP 

[0284] The cloning of a cDNA encoding human T cell 
activation associated kelch-like transcription factor (TA- 
KRP), and analysis of the encoded protein, was accom- 
plished generally as described in Examples 3 and 4. AcDNA 
of 4617 nucleotides (SEQ ID NO: 16) was identified, which 
contained a full open reading frame predicted to encode a 
protein of 575 amino acids (SEQ ID NO: 17). An alignment 
of SEQ ID NO: 16 to SEQ ID NO: 17 is shown in FIG. 9. 
Nucleic acid sequence comparisons, performed as described 
in Example 3, revealed that the predicted protein product 
contained a BPOZ/TB domain at residues 138-252, charac- 
teristic of a class of transcription regulatory proteins 
(Ahmad et al., Proc. Natl. Acad. Sci. U.S.A. 95:12123- 
12128 (1998)) (Tables 12, 13). The protein also contains 
four kelch repeats. 



TABLE 10 



BLAST search results for a TA-WDRP-encoding cDNA sequence. 



Novel cDNA Polypeptide Novel cDNA Novel cDNA 125 bp % 275 bp % 100% Identity 

SEQ ID NO SEQ ID NO Blast Score Blast Description Identity Identity Length 



14 15 890, E = 0 AC020925 Chr. 5 100% 100% 449 

clone CTD- 
2134K2 



[0283] 



TABLE 11 



Protein database search results for TA-WDRP. 



SEQ ID NO Blast Score 



Blast 100% Identity 100% Similarity 

Description Pfam Motif(s) Prosite Motif(s) Length Length 



15 628, E = e-179 AAF54941 

(AE003700) 
CG9799 
(Drosophila) 

494, E = e-138 CAB81036 
(AL161502) 
putative 
WD-repeat 
membrane 
protein 
(Arabidopsis) 



G-protein beta AMP- 
WD-40 repeats dependent 
(PF0400) synthetase 
and ligase 
(PS00455) 
G-protein beta 
WD-40 repeats 
(S00167, 
PS50082, 
PS50294) 



8 



19 



23 
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TABLE 12 



BLAST search results for a TA-KRP-encoding cDNA sequence. 

Novel cDNA Polypeptide Novel cDNA Novel cDNA 125 bp % 275 bp % 100% Identity 

SEQ ID NO SEQ ID NO Blast Score Blast Description Identity Identity Length 

16 17 4339, E = 0 AC020655 100% 100% 3290 

human BAC 
RP11-15B4 



[0285] 

TABLE 13 



Protein database search results for TA-KRP. 

100% Identity 100% Similarity 
SEQ ID NO Blast Score Blast Description Pfam Motif(s) Prosite Motif(s) Length Length 



17 221, E = 3e-56 Kiaal489 human Kelch repeat BTB/POZ domain 10 13 

protein (PF01344) (PS50097) 

187, E = 5e-46 NP006054 BTB/POZ domain N/A 5 9 

sarcomeric (PF00651) 
muscle protein 



Example 9 

Identification of TA-LRRP 

[0286] The cloning of a cDNA encoding a human T cell 
activation associated leucine repeat-rich protein (TA-KRP), 
and analysis of the encoded protein, was accomplished 
generally as described in Examples 3 and 4. AcDNAof 3588 
nucleotides (SEQ ID NO: 18) was identified, which con- 



tained a full open reading frame predicted to encode a 
protein of 803 amino acids (SEQ ID NO: 19). An alignment 
of SEQ ID NO: 18 to SEQ ID NO: 19 is shown in FIG. 10. 
Nucleic acid sequence comparisons, performed as described 
in Example 3, revealed that the predicted protein product 
contained 12 leucine-rich repeats, as well as a bipartite 
nuclear localization signal at residues 228-245 (Tables 14, 
15). 



TABLE 14 



BLAST search results for a TA-LRRP-encoding cDNA sequence. 

100% Identity 

Novel cDNA Polypeptide Novel cDNA Novel cDNA 125 bp % 275 bp % Length 

SEQ ID NO SEQ ID NO Blast Score Blast Description Identity Identity (nucleotides) 



18 19 3457, E = 0 AD00864 100% 100% 1765 

Human DNA 
sequence from 
chr. 19, 

cosmid R28051 



[0287] 

TABLE 15 



Protein database search results for TA-LRRP. 



SEQ Blast Blast Pfam Prosite 100% Identity 100% Similarity 

ID NO Score Description Motif(s) Motif(s) Length Length 



19 850, BAA92675 12 leucine Bipartite nuclear 16 38 

E = 0 (AB037858) rich localization signal 

KIAA1437 repeats 
(H. sapiens) 
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7. REFERENCES CITED 

[0288] All references cited herein are incorporated herein 
by reference in their entirety and for all purposes to the same 
extent as if each individual publication or patent or patent 
application was specifically and individually indicated to be 
incorporated by reference in its entirety for all purposes. 



[0289] Many modifications and variations of the present 
invention can be made without departing from its spirit and 
scope, as will be apparent to those skilled in the art. The 
specific embodiments described herein are offered by way of 
example only, and the invention is to be limited only by the 
terms of the appended claims along with the full scope of 
equivalents to which such claims are entitled. 



SEQUENCE LISTING 

<160> NUMBER OF SEQ ID NOS : 19 

<2 10> SEQ ID NO 1 

<2 11> LENGTH: 3218 

<2 12> TYPE: DNA 

<2 13> ORGANISM: Homo sapiens 

<4 0 0> SEQUENCE: 1 



gacatagctg 


ccctaaaagg 


aatgaggaag 


cgagagctct 


ccagtgtctg 


gctggctccg 


60 


tccgtgtgac 


agcccatgat 


gttctttccg 


gtctctgtaa 


tattctgaat 


ttccacctgc 


120 


ccgccccttc 


gcttataatg 


cagagcatgt 


gaagggagac 


cggctcggtc 


tctctctctc 


180 


ccagtggact 


agaaggagca 


gagagttatg 


ctgtttctcc 


cattctttac 


agctcaccgg 


240 


atgtaaaaga 


actctggcta 


gagaccctcc 


aaggacagag 


gcacagccac 


acgggagtga 


300 


aatccacccc 


tggacagtca 


gccgcaatac 


tgatgaagct 


gagaagcagc 


cacaatgctt 


360 


caaaaacact 


aaacgccaat 


aatatggaga 


cactaatcga 


atgtcaatca 


gagggtgata 


420 


tcaaggaaca 


tcccctgttg 


gcatcatgtg 


agagtgaaga 


cagtatttgc 


cagctcattg 


480 


aagttaagaa 


gagaaagaag 


gtgctgtcct 


ggccctttct 


catgagaagg 


ctctcccctg 


540 


catcagattt 


ttctggggct 


ttggagacag 


acttgaaagc 


atcgctattt 


gatcagccct 


600 


tgtcaattat 


ctgcggtgac 


agtgacacac 


tccccagacc 


catccaggac 


attctcacta 


660 


ttctatgcct 


taaaggccct 


tcaacggaag 


ggatattcag 


gagagcagcc 


aacgagaaag 


720 


cccgtaagga 


gctgaaggag 


gagctcaact 


ctggggatgc 


ggtggatctg 


gagaggctcc 


780 


ccgtgcacct 


cctcgctgtg 


gtctttaagg 


acttcctcag 


aagtatcccc 


cggaagctac 


840 


tttcaagcga 


cctctttgag 


gagtggatgg 


gtgctctgga 


gatgcaggac 


gaggaggaca 


900 


gaatcgaggc 


cctgaaacag 


gttgcagata 


agctcccccg 


gcccaacctc 


ctgctactca 


960 


agcacttggt 


ctatgtgctg 


cacctcatca 


gcaagaactc 


tgaggtgaac 


aggatggact 


1020 


ccagcaatct 


ggccatctgc 


attggaccca 


acatgctcac 


cctggagaat 


gaccagagcc 


1080 


tgtcatttga 


agcccagaag 


gacctgaaca 


acaaggtgaa 


gacactggtg 


gaattcctca 


1140 


ttgataactg 


ctttgaaata 


tttggggaga 


acattccagt 


gcattccagt 


atcacttctg 


1200 


atgactccct 


ggagcacact 


gacagttcag 


atgtgtcgac 


cctgcagaat 


gactcagcct 


1260 


acgacagcaa 


cgaccctgat 


gtggaatcca 


acagcagcag 


tggcatcagc 


tctcccagca 


1320 


ggcagcccca 


ggtgcccatg 


gccacagctg 


ctggcttgga 


tagcgcgggc 


ccacaggatg 


1380 


cccgagaggt 


cagcccagag 


cccattgtga 


gcaccgtggc 


caggctgaaa 


agctccctcg 


1440 


cacagcccga 


taggagatac 


tcagagccca 


gcatgccatc 


ctcccaggag 


tgcctcgaga 


1500 


gccgggtgac 


aaaccaaaca 


ctaacaaaga 


gtgaagggga 


cttccccgtg 


ccccgggtag 


1560 
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gctctcgttt 


ggaaagtgag 


gaggctgaag 


acccatttcc 


agaggaggtc 


ttccctgcag 


1620 


tgcaaggcaa 


aaccaagagg 


ccggtggacc 


tgaagatcaa 


gaacttggcc 


ccgggttcgg 


1680 


tgctcccgcg 


ggcactggtt 


ctcaaagcct 


tctccagcag 


ctcgctggac 


gcgtcctctg 


1740 


acagctcgcc 


cgtggcttct 


ccttccagtc 


ccaaaagaaa 


tttcttcagc 


agacatcagt 


1800 


ctttcaccac 


aaagacagag 


aaaggcaagc 


ccagccgaga 


aattaaaaag 


cactccatgt 


1860 


ctttcacctt 


tgcccctcac 


aaaaaagtgc 


tgaccaaaaa 


cctcagcgcg 


ggctctggga 


1920 


aatcgcaaga 


ctttaccagg 


gaccacgtcc 


cgaggggtgt 


cagaaaggaa 


agccagcttg 


1980 


ccggccgaat 


cgtgcaggaa 


aatgggtgtg 


aaacccacaa 


ccaaacagcc 


cgcggcttct 


2040 


gcctgagacc 


ccacgccctc 


tcggtggatg 


atgtgttcca 


gggagctgac 


tgggagaggc 


2100 


ctggaagccc 


accctcttat 


gaagaggcca 


tgcagggccc 


ggcagccaga 


ctagtggcct 


2160 


ccgagagcca 


gaccgtgggg 


agcatgacgg 


tggggagcat 


gagggcgagg 


atgctggagg 


2220 


cgcactgcct 


cctaccccct 


cttccacctg 


ctcaccacgt 


agaggactca 


agacacaggg 


2280 


gcagcaaaga 


gccactccct 


ggccacggac 


tctctcccct 


gcctgagcga 


tggaaacaga 


2340 


gcagaactgt 


ccatgcttct 


ggggactctc 


tggggcacgt 


gtctggccca 


gggagacctg 


2400 


agctcctccc 


gctgaggacc 


gtctccgagt 


ccgtgcagag 


gaataagcgg 


gactgtctcg 


2460 


tgcgacgatg 


tagccagccg 


gtctttgagg 


ctgaccaatt 


ccaatatgcc 


aaagaatcgt 


2520 


atatttagga 


gggaggccat 


acgccatgcc 


atagcttgtg 


ctatctgtaa 


atatgagact 


2580 


tgtaaagaac 


tgcctgtaga 


ttgtttttaa 


aaggtcttga 


ataagctcct 


tgagaaagtt 


2640 


gtggaaagcc 


ctcctcagtg 


aggatagcta 


caccatggcc 


atggcgcatc 


agatagtctc 


2700 


tgtgtacctg 


gatttgtgca 


atatgtaaaa 


atgtatcaaa 


tgtattatag 


ataaggtgtt 


2760 


aggtgcaaag 


gatgtctaat 


aatccctgca 


cacgttttga 


acttgcagtg 


aagtacactg 


2820 


ctgttccttg 


cttcctgggg 


cacttttctc 


ttggttagtg 


tttaaaaatt 


atcttcgctt 


2880 


ttttaatgtg 


gcctcaaatg 


tcatgccaat 


tttcacatct 


tccacaaact 


ccatttaggg 


2940 


agaaatgttt 


aaatctctgg 


tataagttta 


ctccatacca 


gagtaaacta 


tatattactc 


3000 


tatataagca 


gtcttgcaat 


aactaatcac 


caccatagaa 


gaaagaaaca 


gactgcaagg 


3060 


aacagagttg 


agtgtctgga 


gtcatcaaag 


gcattaaaaa 


ctccagtaaa 


agctggggcc 


3120 


gtagcaaaaa 


tcatgaaaaa 


cacttcaacg 


tgtcctttca 


atcatccaat 


taaatgtggg 


3180 


tagattaatg 


aaaatgtatt 


acatcaatat 


taactcat 






3218 


<210> SEQ ID NO 2 

<211> LENGTH: 3 05 1 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 










<4 00> SEQUENCE: 2 












gacatagctg 


ccctaaaagg 


aatgaggaag 


cgagagctct 


ccagtgtctg 


gctggctccg 


60 


tccgtgtgac 


agcccatgat 


gttcttt.ccg 


gtctctgtaa 


tattctgaat 


ttccacctgc 


120 


ccgccccttc 


gcttataatg 


cagagcatgt 


gaagggagac 


cggctcggtc 


tctctctctc 


180 


ccagtggact 


agaaggagca 


gagagttatg 


ctgtttctcc 


cattctttac 


agctcaccgg 


240 


atgtaaaaga 


actctggcta 


gagaccctcc 


aaggacagag 


gcacagccac 


acgggagtga 


300 


aatccacccc 


tggacagtca 


gccgcaatac 


tgatgaagct 


gagaagcagc 


cacaatgctt 


360 
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caaaaacact 


aaacgccaat 


aatatggaga 


cactaatcga 


atgtcaatca 


gagggtgata 


420 


tcaaggaaca 


tcccctgttg 


gcatcatgtg 


agagtgaaga 


cagtatttgc 


cagctcattg 


480 


gacattctca 


ctattctatg 


ccttaaaggc 


ccttcaacgg 


aagggatatt 


caggagagca 


540 


gccaacgaga 


aagcccgtaa 


ggagctgaag 


gaggagctca 


actctgggga 


tgcggtggat 


600 


ctggagaggc 


tccccgtgca 


cctcctcgct 


gtggtcttta 


aggacttcct 


cagaagtatc 


660 


ccccggaagc 


tactttcaag 


cgacctcttt 


gaggagtgga 


tgggtgctct 


ggagatgcag 


720 


gacgaggagg 


acagaatcga 


ggccctgaaa 


caggttgcag 


ataagctccc 


ccggcccaac 


780 


ctcctgctac 


tcaagcactt 


ggtctatgtg 


ctgcacctca 


tcagcaagaa 


ctctgaggtg 


840 


aacaggatgg 


actccagcaa 


tctggccatc 


tgcattggac 


ccaacatgct 


caccctggag 


900 


aatgaccaga 


gcctgtcatt 


tgaagcccag 


aaggacctga 


acaacaaggt 


gaagacactg 


960 


gtggaattcc 


tcattgataa 


ctgctttgaa 


atatttgggg 


agaacattcc 


agtgcattcc 


1020 


agtatcactt 


ctgatgactc 


cctggagcac 


actgacagtt 


cagatgtgtc 


gaccctgcag 


1080 


aatgactcag 


cctacgacag 


caacgaccct 


gatgtggaat 


ccaacagcag 


cagtggcatc 


1140 


agctctccca 


gcaggcagcc 


ccaggtgccc 


atggccacag 


ctgctggctt 


ggatagcgcg 


1200 


ggcccacagg 


atgcccgaga 


ggtcagccca 


gagcccattg 


tgagcaccgt 


ggccaggctg 


1260 


aaaagctccc 


tcgcacagcc 


cgataggaga 


tactcagagc 


ccagcatgcc 


atcctcccag 


1320 


gagtgcctcg 


agagccgggt 


gacaaaccaa 


acactaacaa 


agagtgaagg 


ggacttcccc 


1380 


gtgccccggg 


taggctctcg 


tttggaaagt 


gaggaggctg 


aagacccatt 


tccagaggag 


1440 


gtcttccctg 


cagtgcaagg 


caaaaccaag 


aggccggtgg 


acctgaagat 


caagaacttg 


1500 


gccccgggtt 


cggtgctccc 


gcgggcactg 


gttctcaaag 


ccttctccag 


cagctcgctg 


1560 


gacgcgtcct 


ctgacagctc 


gcccgtggct 


tctccttcca 


gtcccaaaag 


aaatttcttc 


1620 


agcagacatc 


agtctttcac 


cacaaagaca 


gagaaaggca 


agcccagccg 


agaaattaaa 


1680 


aagcactcca 


tgtctttcac 


ctttgcccct 


cacaaaaaag 


tgctgaccaa 


aaacctcagc 


1740 


gcgggctctg 


ggaaatcgca 


agactttacc 


agggaccacg 


tcccgagggg 


tgtcagaaag 


1800 


gaaagccagc 


ttgccggccg 


aatcgtgcag 


gaaaatgggt 


gtgaaaccca 


caaccaaaca 


1860 


gcccgcggct 


tctgcctgag 


accccacgcc 


ctctcggtgg 


atgatgtgtt 


ccagggagct 


1920 


gactgggaga 


ggcctggaag 


cccaccctct 


tatgaagagg 


ccatgcaggg 


cccggcagcc 


1980 


agactagtgg 


cctccgagag 


ccagaccgtg 


gggagcatga 


cggtggggag 


catgagggcg 


2040 


aggatgctgg 


aggcgcactg 


cctcctaccc 


cctcttccac 


ctgctcacca 


cgtagaggac 


2100 


tcaagacaca 


ggggcagcaa 


agagccactc 


cctggccacg 


gactctctcc 


cctgcctgag 


2160 


cgatggaaac 


agagcagaac 


tgtccatgct 


tctggggact 


ctctggggca 


cgtgtctggc 


2220 


ccagggagac 


ctgagctcct 


cccgctgagg 


accgtctccg 


agtccgtgca 


gaggaataag 


2280 


cgggactgtc 


tcgtgcgacg 


atgtagccag 


ccggtctttg 


aggctgacca 


attccaatat 


2340 


gccaaagaat 


cgtatattta 


ggagggaggc 


catacgccat 


gccatagctt 


gtgctatctg 


2400 


taaatatgag 


acttgtaaag 


aactgcctgt 


agattgtttt 


taaaaggtct 


tgaataagct 


2460 


ccttgagaaa 


gttgtggaaa 


gccctcctca 


gtgaggatag 


ctacaccatg 


gccatggcgc 


2520 


atcagatagt 


ctctgtgtac 


ctggatttgt 


gcaatatgta 


aaaatgtatc 


aaatgtatta 


2580 


tagataaggt 


gttaggtgca 


aaggatgtct 


aataatccct 


gcacacgttt 


tgaacttgca 


2640 
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gtgaagtaca ctgctgttcc ttgcttcctg gggcactttt ctcttggtta gtgtttaaaa 2 70 0 

attatcttcg cttttttaat gtggcctcaa atgtcatgcc aattttcaca tcttccacaa 2 76 0 

actccattta gggagaaatg tttaaatctc tggtataagt ttactccata ccagagtaaa 2 82 0 

ctatatatta ctctatataa gcagtcttgc aataactaat caccaccata gaagaaagaa 2 88 0 

acagactgca aggaacagag ttgagtgtct ggagtcatca aaggcattaa aaactccagt 2940 

aaaagctggg gccgtagcaa aaatcatgaa aaacacttca acgtgtcctt tcaatcatcc 3000 

aattaaatgt gggtagatta atgaaaatgt attacatcaa tattaactca t 3 051 

<210> SEQ ID NO 3 

<211> LENGTH: 731 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<4 00> SEQUENCE: 3 

Met Lys Leu Arg Ser Ser His Asn Ala Ser Lys Thr Leu Asn Ala Asn 
15 10 15 

Asn Met Glu Thr Leu lie Glu Cys Gin Ser Glu Gly Asp lie Lys Glu 
20 25 30 

His Pro Leu Leu Ala Ser Cys Glu Ser Glu Asp Ser lie Cys Gin Leu 
35 40 45 

lie Glu Val Lys Lys Arg Lys Lys Val Leu Ser Trp Pro Phe Leu Met 
50 55 60 

Arg Arg Leu Ser Pro Ala Ser Asp Phe Ser Gly Ala Leu Glu Thr Asp 
65 70 75 80 

Leu Lys Ala Ser Leu Phe Asp Gin Pro Leu Ser lie lie Cys Gly Asp 
85 90 95 

Ser Asp Thr Leu Pro Arg Pro lie Gin Asp lie Leu Thr lie Leu Cys 
100 105 110 

Leu Lys Gly Pro Ser Thr Glu Gly lie Phe Arg Arg Ala Ala Asn Glu 
115 120 125 

Lys Ala Arg Lys Glu Leu Lys Glu Glu Leu Asn Ser Gly Asp Ala Val 
130 135 140 

Asp Leu Glu Arg Leu Pro Val His Leu Leu Ala Val Val Phe Lys Asp 
145 150 155 160 

Phe Leu Arg Ser lie Pro Arg Lys Leu Leu Ser Ser Asp Leu Phe Glu 
165 170 175 

Glu Trp Met Gly Ala Leu Glu Met Gin Asp Glu Glu Asp Arg lie Glu 
180 185 190 

Ala Leu Lys Gin Val Ala Asp Lys Leu Pro Arg Pro Asn Leu Leu Leu 
195 200 205 

Leu Lys His Leu Val Tyr Val Leu His Leu lie Ser Lys Asn Ser Glu 
210 215 220 

Val Asn Arg Met Asp Ser Ser Asn Leu Ala lie Cys lie Gly Pro Asn 
225 230 235 240 

Met Leu Thr Leu Glu Asn Asp Gin Ser Leu Ser Phe Glu Ala Gin Lys 
245 250 255 

Asp Leu Asn Asn Lys Val Lys Thr Leu Val Glu Phe Leu lie Asp Asn 
260 265 270 

Cys Phe Glu lie Phe Gly Glu Asn lie Pro Val His Ser Ser lie Thr 
275 280 285 
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Ser Asp Asp Ser Leu Glu His Thr Asp Ser Ser Asp Val Ser Thr Leu 
290 295 ~ 300 

Gin Asn Asp Ser Ala Tyr Asp Ser Asn Asp Pro Asp Val Glu Ser Asn 
305 310 315 320 

Ser Ser Ser Gly lie Ser Ser Pro Ser Arg Gin Pro Gin Val Pro Met 
325 330 335 

Ala Thr Ala Ala Gly Leu Asp Ser Ala Gly Pro Gin Asp Ala Arg Glu 
340 345 350 

Val Ser Pro Glu Pro lie Val Ser Thr Val Ala Arg Leu Lys Ser Ser 
355 360 365 

Leu Ala Gin Pro Asp Arg Arg Tyr Ser Glu Pro Ser Met Pro Ser Ser 
370 375 380 

Gin Glu Cys Leu Glu Ser Arg Val Thr Asn Gin Thr Leu Thr Lys Ser 
385 390 395 400 

Glu Gly Asp Phe Pro Val Pro Arg Val Gly Ser Arg Leu Glu Ser Glu 
405 410 415 

Glu Ala Glu Asp Pro Phe Pro Glu Glu Val Phe Pro Ala Val Gin Gly 
420 425 430 

Lys Thr Lys Arg Pro Val Asp Leu Lys lie Lys Asn Leu Ala Pro Gly 
435 44 0 4 45 

Ser Val Leu Pro Arg Ala Leu Val Leu Lys Ala Phe Ser Ser Ser Ser 
450 455 460 

Leu Asp Ala Ser Ser Asp Ser Ser Pro Val Ala Ser Pro Ser Ser Pro 
465 470 475 480 

Lys Arg Asn Phe Phe Ser Arg His Gin Ser Phe Thr Thr Lys Thr Glu 
485 490 495 

Lys Gly Lys Pro Ser Arg Glu lie Lys Lys His Ser Met Ser Phe Thr 
500 505 510 

Phe Ala Pro His Lys Lys Val Leu Thr Lys Asn Leu Ser Ala Gly Ser 
515 520 525 

Gly Lys Ser Gin Asp Phe Thr Arg Asp His Val Pro Arg Gly Val Arg 
530 535 540 

Lys Glu Ser Gin Leu Ala Gly Arg lie Val Gin Glu Asn Gly Cys Glu 
545 550 555 560 

Thr His Asn Gin Thr Ala Arg Gly Phe Cys Leu Arg Pro His Ala Leu 
565 570 575 

Ser Val Asp Asp Val Phe Gin Gly Ala Asp Trp Glu Arg Pro Gly Ser 
580 585 590 

Pro Pro Ser Tyr Glu Glu Ala Met Gin Gly Pro Ala Ala Arg Leu Val 
595 600 605 

Ala Ser Glu Ser Gin Thr Val Gly Ser Met Thr Val Gly Ser Met Arg 
610 615 620 

Ala Arg Met Leu Glu Ala His Cys Leu Leu Pro Pro Leu Pro Pro Ala 
625 630 635 640 

His His Val Glu Asp Ser Arg His Arg Gly Ser Lys Glu Pro Leu Pro 
645 650 655 

Gly His Gly Leu Ser Pro Leu Pro Glu Arg Trp Lys Gin Ser Arg Thr 
660 665 670 

Val His Ala Ser Gly Asp Ser Leu Gly His Val Ser Gly Pro Gly Arg 
675 ' 680 6 85 

Pro Glu Leu Leu Pro Leu Arg Thr Val Ser Glu Ser Val Gin Arg Asn 
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690 695 700 

Lys Arg Asp Cys Leu Val Arg Arg Cys Ser Gin Pro Val Phe Glu Ala 
7 05 " ^ 7 10 ~ " 7 15 720 

Asp Gin Phe Gin Tyr Ala Lys Glu Ser Tyr lie 
725 730 



<210> SEQ ID NO 4 

<211> LENGTH: 553 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<4 0 0> SEQUENCE: 4 

Met Gly Ala Leu Glu Met Gin Asp Glu Glu Asp Arg lie Glu Ala Leu 
15 10 15 

Lys Gin Val Ala Asp Lys Leu Pro Arg Pro Asn Leu Leu Leu Leu Lys 
20 25 30 

His Leu Val Tyr Val Leu His Leu lie Ser Lys Asn Ser Glu Val Asn 
35 4 0 45 

Arg Met Asp Ser Ser Asn Leu Ala lie Cys lie Gly Pro Asn Met Leu 
50 55 " 6 0 

Thr Leu Glu Asn Asp Gin Ser Leu Ser Phe Glu Ala Gin Lys Asp Leu 
65 70 75 80 

Asn Asn Lys Val Lys Thr Leu Val Glu Phe Leu lie Asp Asn Cys Phe 

85 90 ~ 95 

Glu lie Phe Gly Glu Asn lie Pro Val His Ser Ser lie Thr Ser Asp 
100 105 110 

Asp Ser Leu Glu His Thr Asp Ser Ser Asp Val Ser Thr Leu Gin Asn 
115 120 125 

Asp Ser Ala Tyr Asp Ser Asn Asp Pro Asp Val Glu Ser Asn Ser Ser 
130 135 140 

Ser Gly lie Ser Ser Pro Ser Arg Gin Pro Gin Val Pro Met Ala Thr 
145 150 155 160 

Ala Ala Gly Leu Asp Ser Ala Gly Pro Gin Asp Ala Arg Glu Val Ser 
165 170 175 

Pro Glu Pro lie Val Ser Thr Val Ala Arg Leu Lys Ser Ser Leu Ala 
180 185 190 

Gin Pro Asp Arg Arg Tyr Ser Glu Pro Ser Met Pro Ser Ser Gin Glu 
195 200 205 

Cys Leu Glu Ser Arg Val Thr Asn Gin Thr Leu Thr Lys Ser Glu Gly 
210 215 220 

Asp Phe Pro Val Pro Arg Val Gly Ser Arg Leu Glu Ser Glu Glu Ala 
225 230 235 240 

Glu Asp Pro Phe Pro Glu Glu Val Phe Pro Ala Val Gin Gly Lys Thr 
245 250 255 

Lys Arg Pro Val Asp Leu Lys lie Lys Asn Leu Ala Pro Gly Ser Val 
260 265 270 

Leu Pro Arg Ala Leu Val Leu Lys Ala Phe Ser Ser Ser Ser Leu Asp 
275 280 285 

Ala Ser Ser Asp Ser Ser Pro Val Ala Ser Pro Ser Ser Pro Lys Arg 
290 295 300 



Asn Phe Phe Ser Arg His Gin Ser Phe Thr Thr Lys Thr Glu Lys Gly 
305 310 315 " 320 
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Lys Pro Ser Arg Glu lie Lys Lys His Ser Met Ser Phe Thr Phe Ala 
325 330 335 

Pro His Lys Lys Val Leu Thr Lys Asn Leu Ser Ala Gly Ser Gly Lys 
340 345 350 

Ser Gin Asp Phe Thr Arg Asp His Val Pro Arg Gly Val Arg Lys Glu 
355 360 365 

Ser Gin Leu Ala Gly Arg lie Val Gin Glu Asn Gly Cys Glu Thr His 
370 375 380 

Asn Gin Thr Ala Arg Gly Phe Cys Leu Arg Pro His Ala Leu Ser Val 
385 390 395 400 

Asp Asp Val Phe Gin Gly Ala Asp Trp Glu Arg Pro Gly Ser Pro Pro 
405 410 415 

Ser Tyr Glu Glu Ala Met Gin Gly Pro Ala Ala Arg Leu Val Ala Ser 
420 425 430 

Glu Ser Gin Thr Val Gly Ser Met Thr Val Gly Ser Met Arg Ala Arg 
435 440 445 

Met Leu Glu Ala His Cys Leu Leu Pro Pro Leu Pro Pro Ala His His 
450 455 460 

Val Glu Asp Ser Arg His Arg Gly Ser Lys Glu Pro Leu Pro Gly His 
465 470 475 480 

Gly Leu Ser Pro Leu Pro Glu Arg Trp Lys Gin Ser Arg Thr Val His 
485 490 495 

Ala Ser Gly Asp Ser Leu Gly His Val Ser Gly Pro Gly Arg Pro Glu 
500 505 510 

Leu Leu Pro Leu Arg Thr Val Ser Glu Ser Val Gin Arg Asn Lys Arg 
515 520 525 

Asp Cys Leu Val Arg Arg Cys Ser Gin Pro Val Phe Glu Ala Asp Gin 
530 535 540 

Phe Gin Tyr Ala Lys Glu Ser Tyr lie 



<210> SEQ ID NO 5 

<211> LENGTH: 3612 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<4 0 0> SEQUENCE: 5 

agaggcaggc ggcttgtgag acgggctcca gagaaaggac ctccctgggt ctctcatttc 60 

ctggctgaag tttctcttct cgctgctgtg gcagcatcca acccacacac acaggacccg 120 

catcctgggt gatgaagtca gacacgcagc agctgggtga gtgctaacgc tcagataagc 18 0 

atctgtgcca ttgtggggac tccctgggct gctctgcacc cggacacttg ctctgtcccc 240 

gccatgtaca acgggtcgtg ctgccgcatc gagggggaca ccatctccca ggtgatgccg 300 

ccgctgctca ttgtggcctt tgtgctgggc gcactaggca atggggtcgc cctgtgtggt 360 

ttctgcttcc acatgaagac ctggaagccc agcactgttt accttttcaa tttggccgtg 420 

gctgatttcc tccttatgat ctgcctgcct tttcggacag actattacct cagacgtaga 480 

cactgggctt ttggggacat tccctgccga gtggggctct tcacgttggc catgaacagg 54 0 

gccgggagca tcgtgttcct tacggtggtg gctgcggaca ggtatttcaa agtggtccac 60 0 

ccccaccacg cggtgaacac tatctccacc cgggtggcgg ctggcatcgt ctgcaccctg 660 

tgggccctgg tcatcctggg aacagtgtat cttttgctgg agaaccatct ctgcgtgcaa 720 
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gagacggccg 


tctcctgtga 


gagcttcatc 


atggagtcgg 


ccaatggctg 


gcatgacatc 


780 


atgttccagc 


tggag-t-tctt 


tatgcccctc 


ggcatcatct 


tattttgctc 


cttcaagatt 


840 


gtttggagcc 


tgaggcggag 


gcagcagctg 


gccagacagg 


ctcggatgaa 


gaaggcgacc 


900 


cggttcatca 


tggtggtggc 


aattgtgttc 


atcacatgct 


acctgcccag 


cgtgtctgct 


960 


agactctatt 


tcctctggac 


ggtgccctcg 


agtgcctgcg 


atccctctgt 


ccatggggcc 


1020 


ctgcacataa 


ccctcagctt 


cacctacatg 


aacagcatgc 


tggatcccct 


ggtgtattat 


1080 


ttttcaagcc 


cctcctttcc 


caaattctac 


aacaagctca 


aaatctgcag 


tctgaaaccc 


1140 


aagcagccag 


gacactcaaa 


aacacaaagg 


ccggaagaga 


tgccaatttc 


gaacctcggt 


1200 


cgcaggagtt 


gcatcagtgt 


ggcaaatagt 


ttccaaagcc 


agtctgatgg 


gcaatgggat 


1260 


ccccacattg 


ttgagtggca 


ctgaacaagc 


agaccaacaa 


cactgaggaa 


gatagagtgg 


1320 


tgacttagaa 


ttaactcgtg 


ctaaggggtc 


gggggctttg 


aaaatgccac 


ccccctttct 


1380 


tattgcaaga 


cggcttctcg 


cacatgaact 


gcatccttct 


cattctgtcg 


gaaatgaaat 


1440 


tcacacaact 


ataccttttg 


gggaggttcc 


agttgattga 


agtgagttgg 


ctgcattttc 


1500 


ttatctgatc 


acaatggcag 


gggacagaat 


gtgcatggag 


tggagcatgt 


gtgtgttggg 


1560 


aggggggcta 


ggaactgcac 


agcccttgtg 


taattttcgt 


tgtttgtttt 


tgttttgaga 


1620 


cagagtctca 


ctctgtgtcc 


caggctggag 


tgcagtggca 


cagtctcggc 


tcactgcaac 


1680 


ctctgcctcc 


cgggttcaag 


caattctcct 


gcctcagcct 


cccgagtagc 


tgggattaga 


1740 


ggcgccagcc 


aacacacccg 


gctaattttt 


gtatttttag 


tagagacagg 


gttttgccat 


1800 


gttggccagg 


ctggtctcga 


gctcctgacc 


tcaggtgatc 


cgcctgcctt 


ggcctcccaa 


1860 


agtggtggga 


tcacaggcgt 


gagccaccgt 


gcccggcctc 


ccctgtgtca 


ttttaaatgg 


1920 


ctaagtaaat 


gggtatatgt 


gtttgaatgg 


ggcatgttca 


ctctcttagg 


ggctatgggg 


1980 


cagttagcag 


catttcctat 


cctctgacct 


taaatcattc 


cttatctcag 


aaaacagaaa 


2040 


ccgggctcag 


tcaatcaatg 


ctttatttca 


ggccgaatga 


ggctctttag 


attgggatct 


2100 


attgatctat 


caattttcat 


ctttacattt 


ctttgtacat 


ctgtacattt 


tgtccaaatg 


2160 


tacatctgta 


cgtctgtcat 


cattgtgact 


tcctggtagc 


ccaagaagaa 


caacaacaaa 


2220 


acaatctgct 


ctgaccttct 


tcaaatcttt 


gtatttcaaa 


gaaggtgctg 


agggatctgt 


2280 


ttccttgccc 


tggcttctcc 


agtgggatgt 


gctgagtcca 


atacaattgc 


ttttataatt 


2340 


gcttttgaca 


acttgtcatg 


tgactgtgaa 


ttgaaattat 


tcacttattt 


tccaagtatt 


2400 


tactgaattc 


gtatttggtg 


gcaggcagta 


tactgtgtaa 


tttttagtgg 


agggtcatta 


2460 


gtcaactctt 


atgtgacagt 


aaagtttttt 


gggggggtgg 


ggacagagaa 


gttaagagct 


2520 


ttcatccttt 


cacggaatac 


agtttctaga 


ccgattctgt 


gtgaacatca 


gttttgtcct 


2580 


cttattgcaa 


gactccctca 


tacacatgag 


tttcccaaat 


gtgtacctgg 


acccctcgaa 


2640 


acagaggact 


ctacgaaatg 


acaggctgcc 


cctgccctga 


attaggggga 


aacattccag 


2700 


gccaactcta 


gctcctttct 


caagctacaa 


agtggtgaac 


atggttctca 


actccttaat 


2760 


ttatactctc 


tcaaatgccc 


aggatactct 


acccacttaa 


gaaccttgcc 


aacttctggg 


2820 


ggttgggcat 


ggtggctcgc 


gcttgtgatc 


ccagcacttt 


gggagactga 


ggcggatcac 


2880 


ctgaggtcag 


gagttctaga 


ccagcctgac 


cgacatggag 


aaacctcgtc 


tctactgaaa 


2940 


attcaaaatt 


agcctggtgt 


ggtggcgcat 


gcctatagtc 


tcagcctcca 


gagtagctgg 


3000 
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gactgcgggc 


gccccaccac 


cacgcccggc 


taattttttg 


tatttttagt 


acagacgggg 


3060 


tttcattgtg 


ttagccggga 


tggtcttgat 


ctcctgactt 


gtgatccgcc 


tgcctcggcc 


3120 


tcccaaagtg 


cttggattac 


aggtgtaagc 


caccgcaccc 


cgcccagcct 


ggcagatttt 


3180 


atttaatcat 


ttgtagcttc 


attttcctcg 


tctgtcaaac 


agggatactg 


taatacaacc 


3240 


tcagtgtgtc 


attgggcagt 


ttaaatgaat 


gtacattcct 


gaggcatcag 


aactttgttc 


3300 


actgttatat 


acccaatgcc 


tagaagagga 


cctgcacata 


gcaggtgctc 


agtaaatgtt 


3360 


tgttgaatga 


atgattaagt 


gcatgtaaag 


cattaagcat 


agcgcctggc 


agtaagtgct 


3420 


caatattatg 


acttcttata 


ttaacacgtt 


ttacatataa 


agaaatggag 


gcaagaaagc 


3480 


atttcctttg 


gggtttagag 


cgcttaagtt 


gt.-tcct.ctgt 


tatcatgcct 


gaattccccc 


3540 


gcccctcagt 


tacctgggga 


agagtaaagg 


caagaattct 


taccagcatt 


agtcatacat 


3600 


cctcctgata 


gg 










3612 


<210> SEQ ID NO 6 

<211> LENGTH: 2345 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 










<4 00> SEQUENCE: 6 












agaggcaggc 


ggcttgtgag 


acgggctcca 


gagaaaggac 


ctccctgggt 


ctctcatttc 


60 


ctggctgaag 


tttctcttct 


cgctgctgtg 


gcagcatcca 


acccacacac 


acaggacccg 


120 


catcctgggt 


gatgaagtca 


gacacgcagc 


agctgggtga 


gtgctaacgc 


tcagataagc 


180 


atctgtgcca 


ttgtggggac 


tccctgggct 


gctctgcacc 


cggacacttg 


ctctgtcccc 


240 


gccatgtaca 


acgggtcgtg 


ctgccgcatc 


gagggggaca 


ccatctccca 


ggtgatgccg 


300 


ccgctgctca 


ttgtggcctt 


tgtgctgggc 


gcactaggca 


atggggtcgc 


cctgtgtggt 


360 


ttctgcttcc 


acatgaagac 


ctggaagccc 


agcactgttt 


accttttcaa 


tttggccgtg 


420 


gctgatttcc 


tccttatgat 


ctgcctgcct 


tttcggacag 


actattacct 


cagacgtaga 


480 


cactgggctt 


ttggggacat 


tccctgccga 


gtggggctct 


tcacgttggc 


catgaacagg 


540 


gccgggagca 


tcgtgttcct 


tacggtggtg 


gctgcggaca 


ggtatttcaa 


agtggtccac 


600 


ccccaccacg 


cggtgaacac 


tatctccacc 


cgggtggcgg 


ctggcatcgt 


ctgcaccctg 


660 


tgggccctgg 


tcatcctggg 


aacagtgtat 


cttttgctgg 


agaaccatct 


ctgcgtgcaa 


720 


gagacggccg 


tctcctgtga 


gagcttcatc 


atggagtcgg 


ccaatggctg 


gcatgacatc 


780 


atgttccagc 


tggagttctt 


tatgcccctc 


ggcatcatct 


tattttgctc 


cttcaagatt 


840 


gtttggagcc 


tgaggcggag 


gcagcagctg 


gccagacagg 


ctcggatgaa 


gaaggcgacc 


900 


cggttcatca 


tggtggtggc 


aattgtgttc 


atcacatgct 


acctgcccag 


cgtgtctgct 


960 


agactctatt 


tcctctggac 


ggtgccctcg 


agtgcctgcg 


atccctctgt 


ccatggggcc 


1020 


ctgcacataa 


ccctcagctt 


cacctacatg 


aacagcatgc 


tggatcccct 


ggtgtattat 


1080 


ttttcaagcc 


cctcctttcc 


caaattctac 


aacaagctca 


aaatctgcag 


tctgaaaccc 


1140 


aagcagccag 


gacactcaaa 


aacacaaagg 


ccggaagaga 


tgccaatttc 


gaacctcggt 


1200 


cgcaggagtt 


gcatcagtgt 


ggcaaatagt 


ttccaaagcc 


agtctgatgg 


gcaatgggat 


1260 


ccccacattg 


ttgagtggca 


ctgaacaagc 


agaccaacaa 


cactgaggaa 


gatagagtgg 


1320 


tgacttagaa 


ttaactcgtg 


ctaaggggtc 


gggggctttg 


aaaatgccac 


ccccctttct 


1380 
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tattgcaaga cggcttctcg cacatgaact gcatccttct cattctgtcg gaaatgaaat 1440 

tcacacaact ataccttttg gggaggttcc agtt.gatt.ga agtgagttgg ctgcattttc 150 0 

ttatctgatc acaatggcag gggacagaat gtgcatggag tggagcatgt gtgtgttggg 156 0 

aggggggcta ggaactgcac agcccttgtg taattttcgt tgtttgtttt tgttttgaga 162 0 

cagagtctca ctctgtgtcc caggctggag tgcagtggca cagtctcggc tcactgcaac 1680 

ctctgcctcc cgggttcaag caattctcct gtctcagcct ccagagtagc tgggactacg 1740 

ggcgccccac caccacgccc ggctaatttt ttgtattttt agtacagacg gggtttcatt 1800 

gtgttagccg ggatggtctt gatctcctga cttgtgatcc gcctgcctcg gcctcccaaa 1860 

gtgcttggat tacaggtgta agccaccgca ccccgcccag cctggcagat tttatttaat 1920 

catttgtagc ttcattttcc tcgtctgtca aacagggata ctgtaataca acctcagtgt 198 0 

gtcattgggc agtttaaatg aatgtacatt cctgaggcat cagaactttg ttcactgtta 2 04 0 

tatacccaat gcctagaaga ggacctgcac atagcaggtg ctcagtaaat gtttgttgaa 210 0 

tgaatgatta agtgcatgta aagcattaag catagcgcct ggcagtaagt gctcaatatt 216 0 

atgacttctt atattaacac gttttacata taaagaaatg gaggcaagaa agcatttcct 2220 

ttggggttta gagcgcttaa gttgttcctc tgttatcatg cctgaattcc cccgcccctc 2280 

agttacctgg ggaagagtaa aggcaagaat tcttaccagc attagtcata catcctcctg 2 34 0 
atagg 2345 

<210> SEQ ID NO 7 

<211> LENGTH: 346 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<4 00> SEQUENCE: 7 

Met Tyr Asn Gly Ser Cys Cys Arg lie Glu Gly Asp Thr lie Ser Gin 
15 10 15 

Val Met Pro Pro Leu Leu lie Val Ala Phe Val Leu Gly Ala Leu Gly 
20 25 30 

Asn Gly Val Ala Leu Cys Gly Phe Cys Phe His Met Lys Thr Trp Lys 
35 40 45 

Pro Ser Thr Val Tyr Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu 

50 55 60 

Met lie Cys Leu Pro Phe Arg Thr Asp Tyr Tyr Leu Arg Arg Arg His 
65 70 75 " ~ 80 

Trp Ala Phe Gly Asp lie Pro Cys Arg Val Gly Leu Phe Thr Leu Ala 
85 90 95 

Met Asn Arg Ala Gly Ser lie Val Phe Leu Thr Val Val Ala Ala Asp 
100 105 110 

Arg Tyr Phe Lys Val Val His Pro His His Ala Val Asn Thr lie Ser 
115 120 125 

Thr Arg Val Ala Ala Gly lie Val Cys Thr Leu Trp Ala Leu Val lie 

130 135 140 

Leu Gly Thr Val Tyr Leu Leu Leu Glu Asn His Leu Cys Val Gin Glu 
145 150 155 160 

Thr Ala Val Ser Cys Glu Ser Phe lie Met Glu Ser Ala Asn Gly Trp 
165 170 175 
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His Asp lie Met Phe Gin Leu Glu Phe Phe Met Pro Leu Gly lie lie 
180 185 190 

Leu Phe Cys Ser Phe Lys lie Val Trp Ser Leu Arg Arg Arg Gin Gin 
195 200 205 

Leu Ala Arg Gin Ala Arg Met Lys Lys Ala Thr Arg Phe lie Met Val 
210 215 220 

Val Ala lie Val Phe lie Thr Cys Tyr Leu Pro Ser Val Ser Ala Arg 
225 230 235 240 

Leu Tyr Phe Leu Trp Thr Val Pro Ser Ser Ala Cys Asp Pro Ser Val 
245 250 255 

His Gly Ala Leu His lie Thr Leu Ser Phe Thr Tyr Met Asn Ser Met 
260 265 270 

Leu Asp Pro Leu Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Lys Phe 

275 280 285 

Tyr Asn Lys Leu Lys lie Cys Ser Leu Lys Pro Lys Gin Pro Gly His 
290 295 300 

Ser Lys Thr Gin Arg Pro Glu Glu Met Pro lie Ser Asn Leu Gly Arg 
305 310 315 320 

Arg Ser Cys lie Ser Val Ala Asn Ser Phe Gin Ser Gin Ser Asp Gly 
325 330 335 

Gin Trp Asp Pro His lie Val Glu Trp His 



<210> SEQ ID NO 8 
<211> LENGTH: 374 8 
<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 8 

ggctgaggcg gaggagccgc cgcttctgac ctcgctctgg ctccggtgcg cgcggctgag 60 

cgcgtgcgag gccccgcgcg tcgggcaggg gcggcggcgg ccactgcgcg ccgccctgag 120 

gagcgcccca gcggcggcgc gactgcggct gaggagagag ccggctccgg gcctccgcgt 180 

cctcctgctc ccccggcccc ccgcctcctc gggggggcgg cggcggcgat gttctcggtc 240 

ctctcgtacg ggcggctggt ggcccgcgcc gtgctcggcg gcctctcgca gaccgacccc 300 

agggccggcg gcggcggcgg cggcgactac ggactggtga cggccggctg cggcttcggg 360 

aaggacttcc gtaagggcct cctcaagaag ggcgcgtgct acggggacga cgcgtgcttc 420 

gtggcccggc accgttccgc ggacgtgctc ggggttgcag atggtgtagg aggctggaga 480 

gactatggag ttgatccatc tcaattctca gggactttaa tgcggacgtg tgaacgttta 54 0 

gtaaaagaag gacggttcgt acctagtaat cccattggaa ttctcaccac aagctactgt 60 0 

gagttgctgc aaaataaagt ccctttgctc ggtagcagca ccgcctgcat tgtggtgctg 66 0 

gacagaacca gccaccgctt acacacagca aacctgggcg attcaggctt cctggttgtc 720 

a gggg"tgg"tg aagtcgtgca ccgatcagat gagcagcagc attacttcaa cactccattc 780 

cagctctcaa tcgctccccc tgaagccgag ggagtcgtct tgagcgacag tccggatgct 840 

gctgatagca cgtctttcga tgtccagcta ggagacatta tcctgacggc aacagatgga 900 

ctctttgaca acatgcctga ttatatgatt cttcaggagc taaaaaagtt aaagaattca 960 

aattatgaga gtatacaaca gactgccaga agcattgctg agcaagctca tgagctggcc 1020 

tatgacccaa attatatgtc accttttgca cagtttgcat gtgacaatgg attgaatgtg 108 0 
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agaggtggaa 


agccagatga 


catcaccgtc 


cttctttcaa 


tagtggctga 


gtatacagac 


1140 


tagctgaggt 


gtcaagtcct 


gcctttcctt 


tcatcatccc 


aaatttcccc 


tgccatgtgt 


1200 


gctgatcctg 


ctggcaggac 


cacatttctt 


tgccactgat 


ctcaatggcc 


agtgatgtaa 


1260 


gtcttttgcc 


tgtcttcttg 


agactcgttg 


agatctttgt 


tgagaaccac 


tactatcatt 


1320 


cactagctca 


tatctgccgg 


cagcaattga 


agagatccaa 


tatttgaaga 


ttggccttca 


1380 


tttctcgatg 


ttctttccat 


gatggggatg 


gaggtgttca 


gtgccaccgt 


ggctgttact 


1440 


tttcaaagta 


gttgaagtat 


tgaaaatgag 


taatgttggt 


aaagtgaatt 


caaaatccta 


1500 


gtatgctaaa 


gggatggtac 


aagtctaaca 


caaattgtac 


gtaatgatac 


atctactaga 


1560 


aacatacatt 


attcatcaaa 


agaaatgtta 


catgtgtact 


ccacaggcat 


agtctttgtt 


1620 


atgatgattg 


gtgtggcttt 


atgtctttgt 


tataaactcc 


tatttttcag 


gggcttatga 


1680 


ttctgctcta 


aaacattgct 


ctgggttata 


cagttttgat 


cccaaaagct 


tttttgttac 


1740 


aaatcgggag 


aaaaatccat 


tttagttcta 


tggatggaaa 


tatttcatgc 


ttttaaaaag 


1800 


atgtttgtgt 


tcctgtggtt 


aaagttttgg 


cagtttattg 


attagtccaa 


atcacaggct 


1860 


aaggcctgat 


ctccaggagg 


ggtaggggag 


acactttacc 


agtatttttt 


tatggaaata 


1920 


atactcaagg 


ttgtaaaacc 


cctcaaagcc 


tagaaattta 


attgttatgg 


ctgaaattcc 


1980 


tcctagttgt 


ctgatagaat 


gcccctgaat 


gggaactcta 


ggtcccaagg 


cctgaagggt 


2040 


tgagaacaga 


cagctgtaac 


tttgaatttt 


gttggctttc 


agtggtcatg 


ctacctaccc 


2100 


atactcgtac 


tctcagacct 


tttattagta 


gccttgcttt 


ctatagagca 


tgcaccaaat 


2160 


ccagtgagtc 


catgtggaga 


gagcactgtg 


tgcgcagcgg 


cagcagcaca 


gacgtccatg 


2220 


aggaaaactc 


ccagtgatga 


tctgacattt 


acaattaccc 


cacatggaaa 


tttaggggtt 


2280 


tctgaatcaa 


gcttaatgtt 


tacagtttcc 


aaatagccat 


tttgcagtgt 


atagtttcct 


2340 


tacaaaacta 


ccccgcattc 


agttttcaca 


ttatctgcaa 


gctgaaactt 


atttttaagt 


2400 


tttgtgtaca 


agttgactgc 


tgtaaagata 


tatatttttg 


ggtcagtttt 


tttccttcat 


2460 


taacttggtg 


gtagaaaaaa 


atatatactt 


agaaatcctt 


aaattaaagc 


catgttttat 


2520 


atataagtca 


ggtaacattg 


gtgtatagat 


gagaatgcaa 


ttaaacctga 


tgagaatcta 


2580 


cttgagaata 


tagaaagtct 


ttctctaaag 


gagatactga 


ctccctggtt 


tattgcatta 


2640 


aaatttatgt 


ttgaggttac 


ctcaacttgt 


tttaaaagat 


tttgttttgt 


gaatttgtac 


2700 


tgtatatttg 


agtaactgtc 


aggcttttat 


ttaaaattgt 


ttaacatgta 


ccatgtacat 


2760 


gtcattacta 


tatttcaatg 


catcatgctt 


gtaacaggca 


tttcatttat 


aataagaatg 


2820 


agttattcat 


ttgtaagccg 


ttcagtaatt 


tatctactat 


tcctaaattg 


gcataatgtt 


2880 


agataatcta 


ttttgaatca 


cctttaatta 


catgtcagaa 


tgccttaact 


accctaactt 


2940 


gacaaaacag 


aattctttgg 


tagacgcggt 


gggggcgggg 


tggggggtct 


ggacggagtc 


3000 


tctatttaag 


gagaaatcat 


catgctatga 


taaaacacag 


aagcatgagt 


ggcaagtggc 


3060 


ggggtattta 


ttttgcacaa 


actatttgca 


gtct.ct.gtgt 


atttaaaaag 


taaagaaagt 


3120 


tgcatccaga 


agggttttgt 


tagaatgaat 


acatttatat 


taggactgac 


aacttcagct 


3180 


cttttgttta 


ggttttcaat 


tatttttggt 


aagagtatgt 


agccttatga 


tctggatata 


3240 


ttttgcattc 


attttccaac 


gcctacattt 


aattcctggt 


aagagcagtg 


ctcgtcaagt 


3300 


ttctggtttt 


tctctgctct 


catttaaccc 


gtcaaacaca 


atctttgtaa 


agctagattg 


3360 
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gtggtgtttt atacaactta tttactcagc ttaccttttt gagaaacgat tgttagaaat 3 42 0 

tgacgatgtg tttgttccag tgatactgaa agtagtgggg gcaagaattg agtttcacag 3 48 0 

tggaattggc tttggatctg gcctatagat tagtgacata aaatattttc tctattttcc 354 0 

cctgttcttt ttgtgttatg cacttaattt tatgactgcc gggggggtca gctggagtgc 3 60 0 

tgcttaacaa gtatctctcc tactctcagt ggtcagaggc tgtgttggac ccatagtaga 3 66 0 

attttccagg tcacagaccc aagcttccat gggttgttac tgtgctgtac cacttggtgg 3 72 0 

gtctgattct gaacctgatg tgtgtgtt 3748 

<210> SEQ ID NO 9 
<211> LENGTH: 304 
<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 9 

Met Phe Ser Val Leu Ser Tyr Gly Arg Leu Val Ala Arg Ala Val Leu 
1 5 ^ 10 15 

Gly Gly Leu Ser Gin Thr Asp Pro Arg Ala Gly Gly Gly Gly Gly Gly 
20 25 30 

Asp Tyr Gly Leu Val Thr Ala Gly Cys Gly Phe Gly Lys Asp Phe Arg 
35 40 45 

Lys Gly Leu Leu Lys Lys Gly Ala Cys Tyr Gly Asp Asp Ala Cys Phe 

50 " J 55 ~ 6 0 

Val Ala Arg His Arg Ser Ala Asp Val Leu Gly Val Ala Asp Gly Val 
65 70 75 80 

Gly Gly Trp Arg Asp Tyr Gly Val Asp Pro Ser Gin Phe Ser Gly Thr 
85 90 95 

Leu Met Arg Thr Cys Glu Arg Leu Val Lys Glu Gly Arg Phe Val Pro 
100 105 110 

Ser Asn Pro lie Gly lie Leu Thr Thr Ser Tyr Cys Glu Leu Leu Gin 
115 120 125 

Asn Lys Val Pro Leu Leu Gly Ser Ser Thr Ala Cys lie Val Val Leu 
130 135 140 

Asp Arg Thr Ser His Arg Leu His Thr Ala Asn Leu Gly Asp Ser Gly 
145 150 155 160 

Phe Leu Val Val Arg Gly Gly Glu Val Val His Arg Ser Asp Glu Gin 
165 170 175 

Gin His Tyr Phe Asn Thr Pro Phe Gin Leu Ser lie Ala Pro Pro Glu 
180 185 190 

Ala Glu Gly Val Val Leu Ser Asp Ser Pro Asp Ala Ala Asp Ser Thr 
195 200 205 

Ser Phe Asp Val Gin Leu Gly Asp lie lie Leu Thr Ala Thr Asp Gly 
210 215 220 

Leu Phe Asp Asn Met Pro Asp Tyr Met lie Leu Gin Glu Leu Lys Lys 
225 " 230 235 240 

Leu Lys Asn Ser Asn Tyr Glu Ser lie Gin Gin Thr Ala Arg Ser lie 
245 250 255 

Ala Glu Gin Ala His Glu Leu Ala Tyr Asp Pro Asn Tyr Met Ser Pro 
260 265 270 

Phe Ala Gin Phe Ala Cys Asp Asn Gly Leu Asn Val Arg Gly Gly Lys 
275 280 285 
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Pro Asp Asp lie Thr Val Leu Leu Ser lie Val Ala Glu Tyr Thr Asp 
29 0 2 95 3 00 

<210> SEQ ID NO 10 

<211> LENGTH: 173 6 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<4 00> SEQUENCE: 10 



aaaatttgct 


gattaaatga 


atgtgggtgt 


gtttgagagg 


gatcctagac 


agccaagcct 


60 


tctggcatga 


aacgctgaga 


agatgggagt 


gtctgctggc 


agagatgaaa 


gtgagcaggg 


120 


gtgagcgcag 


ccactgccca 


acgcaaaccg 


tgaagaagct 


tctggaagag 


cagaggcgcc 


180 


gccagcagca 


gcagcccgac 


gctggcgggg 


tgcagggaca 


atttctccct 


cccccagagc 


240 


agcccctgac 


cccatctgtg 


aatgaggctg 


tgactggcca 


ccctcccttc 


ccagcacact 


300 


cggagactgt 


gggttctgga 


cctagcagcc 


tgggctttcc 


agactgggac 


cccaacacgc 


360 


atgctgccta 


cactgacagc 


ccctactctt 


gccctgcttc 


tgctgccgaa 


aatttcctgc 


420 


ctcctgactt 


ctacccaccc 


tcggacccag 


ggcagccgtg 


cccatttccc 


cagggcatgg 


480 


aqqctqqacc 

ZJ ZJ ZJ ZJ 


ctqqaqaqtt 

Zj Zj Zj Zj 


tctgcacccc 


cttcaggacc 


cccacagttc 


cccqctqtqq 

3 ZJ ZJ _? 


540 


tccctggacc 


atcgctggag 


gtggcccgag 


ctcacatgct 


ggctttgggg 


ccacagcagc 


600 


tgctggccca 


ggatgaggag 


ggggacacgc 


tccttcacct 


gtttgcggct 


cgggggctgc 


660 


gctgggcggc 


atatgctgcg 


gctgaggtgc 


tccaggtgta 


ccggcgtctt 


gacattcgtg 


720 


agcataaggg 


caagacccct 


ctcctqqtqq 


cqqctqctqc 


caaccagccc 


ctqattqtqq 

3 3 33 


780 


aggatctgtt 


gaacctggga 


gcagagccca 


atgccgctga 


ccatcaggga 


cgttcggtct 


840 


tgcacgtggc 


cgctacctac 


gggctcccag 


gagttctctt 


ggctgtgctt 


aactctgggg 


900 


tccaggttga 


cctggaagcc 


agagacttcg 


agggcctcac 


cccgctccac 


acggccatcc 


960 


tggcccttaa 


cgttgctatg 


cgcccttccg 


acctctgtcc 


ccgggtgctg 


agcacacagg 


1020 


cccgagacag 


gctggattgt 


gtccacatgt 


tgctgcaaat 


gggtgctaat 


cacaccagcc 


1080 


aggagatcaa 


gagcaacaag 


acagttctgc 


acttggccgt 


gcaggctgcc 


aaccccactc 


1140 


tggttcagct 


gctgctggag 


ctgccccggg 


gagacctgcg 


gacctttgtc 


aacatgaagg 


1200 


cccacgggaa 


cacagccctc 


cacatggcgg 


ctgccctgcc 


ccctgggccg 


gcccaggagg 


1260 


ccatcgtgcg 


gcacctgttg 


gcagctgggg 


cggaccccac 


actgcgcaac 


ctggagaatg 


1320 


agcagcccgt 


tcacctgctg 


cggcccgggc 


cgggccctga 


ggggctccgg 


cagctgttga 


1380 


agaggagccg 


tgtggcgccg 


ccaggcctgt 


cctcttagga 


ctcaaaccca 


gaccctggac 


1440 


tgattttcca 


gtccccaccg 


tcctgcggga 


cagccagcgt 


atgctaatgt 


tgcaaaccca 


1500 


tgataatgta 


tgtggaatat 


cctgccattg 


gggttttaca 


ttaaaacccc 


agaatggctg 


1560 


cagaggggtg 


aacaggcccc 


aatatttggg 


gtgctgtgat 


acccctcttc 


tacccacaag 


1620 


gagccctctt 


gatgatttct 


gtgaaatcga 


ggccccttga 


ttgtttctgt 


gaaacaccct 


1680 


gcacccctag 


tcctttcccc 


actgagatct 


ttcgggttct 


ctcccctaac 


tcagct 


1736 



<210> SEQ ID NO 11 

<211> LENGTH: 465 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 
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<4 00> SEQUENCE: 11 

Met Trp Val Cys Leu Arg Gly lie Leu Asp Ser Gin Ala Phe Trp His 
1 5 ~ 10 15 

Glu Thr Leu Arg Arg Trp Glu Cys Leu Leu Ala Glu Met Lys Val Ser 
20 25 30 

Arg Gly Glu Arg Ser His Cys Pro Thr Gin Thr Val Lys Lys Leu Leu 
35 40 45 

Glu Glu Gin Arg Arg Arg Gin Gin Gin Gin Pro Asp Ala Gly Gly Val 
50 55 60 

Gin Gly Gin Phe Leu Pro Pro Pro Glu Gin Pro Leu Thr Pro Ser Val 
65 70 75 80 

Asn Glu Ala Val Thr Gly His Pro Pro Phe Pro Ala His Ser Glu Thr 
85 90 95 

Val Gly Ser Gly Pro Ser Ser Leu Gly Phe Pro Asp Trp Asp Pro Asn 
100 105 110 

Thr His Ala Ala Tyr Thr Asp Ser Pro Tyr Ser Cys Pro Ala Ser Ala 
115 12 0 125 

Ala Glu Asn Phe Leu Pro Pro Asp Phe Tyr Pro Pro Ser Asp Pro Gly 
130 135 140 

Gin Pro Cys Pro Phe Pro Gin Gly Met Glu Ala Gly Pro Trp Arg Val 
145 150 155 160 

Ser Ala Pro Pro Ser Gly Pro Pro Gin Phe Pro Ala Val Val Pro Gly 
165 170 175 

Pro Ser Leu Glu Val Ala Arg Ala His Met Leu Ala Leu Gly Pro Gin 
180 185 190 

Gin Leu Leu Ala Gin Asp Glu Glu Gly Asp Thr Leu Leu His Leu Phe 
195 200 205 

Ala Ala Arg Gly Leu Arg Trp Ala Ala Tyr Ala Ala Ala Glu Val Leu 
210 215 220 

Gin Val Tyr Arg Arg Leu Asp lie Arg Glu His Lys Gly Lys Thr Pro 
225 230 235 240 

Leu Leu Val Ala Ala Ala Ala Asn Gin Pro Leu lie Val Glu Asp Leu 
245 250 255 

Leu Asn Leu Gly Ala Glu Pro Asn Ala Ala Asp His Gin Gly Arg Ser 
260 265 270 

Val Leu His Val Ala Ala Thr Tyr Gly Leu Pro Gly Val Leu Leu Ala 

275 280 285 

Val Leu Asn Ser Gly Val Gin Val Asp Leu Glu Ala Arg Asp Phe Glu 
290 295 300 

Gly Leu Thr Pro Leu His Thr Ala lie Leu Ala Leu Asn Val Ala Met 
305 310 315 320 

Arg Pro Ser Asp Leu Cys Pro Arg Val Leu Ser Thr Gin Ala Arg Asp 
325 330 335 

Arg Leu Asp Cys Val His Met Leu Leu Gin Met Gly Ala Asn His Thr 
340 345 350 

Ser Gin Glu lie Lys Ser Asn Lys Thr Val Leu His Leu Ala Val Gin 
355 360 365 

Ala Ala Asn Pro Thr Leu Val Gin Leu Leu Leu Glu Leu Pro Arg Gly 
370 375 380 

Asp Leu Arg Thr Phe Val Asn Met Lys Ala His Gly Asn Thr Ala Leu 
385 390 395 ~ 400 
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His Met Ala Ala Ala Leu Pro Pro Gly Pro Ala Gin Glu Ala lie Val 
405 410 415 

Arg His Leu Leu Ala Ala Gly Ala Asp Pro Thr Leu Arg Asn Leu Glu 
420 425 430 

Asn Glu Gin Pro Val His Leu Leu Arg Pro Gly Pro Gly Pro Glu Gly 
435 440 445 

Leu Arg Gin Leu Leu Lys Arg Ser Arg Val Ala Pro Pro Gly Leu Ser 
450 455 460 

Ser 
465 

<210> SEQ ID NO 12 

<211> LENGTH: 183 4 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<400> SEQUENCE: 12 



ttcgccggag 


cgcgacccgg 


ggactcccag 


gcctgtgggc 


gggccctgcc 


caggactggg 


60 


cggtgccata 


acccctagtt 


taaaaactcg 


cgggtaccgg 


acccaagatc 


ggggacccgg 


120 


cggcggctcc 


gcgggggaaa 


cagcgaggct 


ggcgcagcgc 


cgaggccgcg 


gccctggggg 


180 


cccgcaatcc 


acgccacgga 


atccccgagt 


gagcaggggt 


gagcgcagcc 


actgcccaac 


240 


gcaaaccgtg 


aagaagcttc 


tggaagagca 


gaggcgccgc 


cagcagcagc 


agcccgacgc 


300 


tggcggggtg 


cagggacaat 


ttctccctcc 


cccagagcag 


cccctgaccc 


catctgtgaa 


36 0 


tgaggctgtg 


actggccacc 


ctcccttccc 


agcacactcg 


gagactgtgg 


gttctggacc 


420 


tagcagcctg 


ggctttccag 


actgggaccc 


caacacgcat 


gctgcctaca 


ctgacagccc 


480 


ctactcttgc 


cctgcttctg 


ctgccgaaaa 


tttcctgcct 


cctgacttct 


acccaccctc 


540 


ggacccaggg 


cagccgtgcc 


catttcccca 


gggcatggag 


gctggaccct 


ggagagtttc 


600 


tgcaccccct 


tcaggacccc 


cacagttccc 


cgctgtggtc 


cctggaccat 


cgctggaggt 


660 


ggcccgagct 


cacatgctgg 


ctttggggcc 


acagcagctg 


ctggcccagg 


atgaggaggg 


720 


ggacacgctc 


cttcacctgt 


ttgcggctcg 


ggggctgcgc 


tgggcggcat 


atgctgcggc 


780 


tgaggtgctc 


caggtgtacc 


ggcgtcttga 


cattcgtgag 


cataagggca 


agacccctct 


840 


cctggtggcg 


gctgctgcca 


accagcccct 


gattgtggag 


gatctgttga 


acctgggagc 


900 


agagcccaat 


gccgctgacc 


atcagggacg 


ttcggtcttg 


cacgtggccg 


ctacctacgg 


960 


gctcccagga 


gttctcttgg 


ctgtgcttaa 


ctctggggtc 


caggttgacc 


tggaagccag 


1020 


agacttcgag 


ggcctcaccc 


cgctccacac 


ggccatcctg 


gcccttaacg 


ttgctatgcg 


1080 


cccttccgac 


ctctgtcccc 


gggtgctgag 


cacacaggcc 


cgagacaggc 


tggattgtgt 


1140 


ccacatgttg 


ctgcaaatgg 


gtgctaatca 


caccagccag 


gagatcaaga 


gcaacaagac 


1200 


agttctgcac 


ttggccgtgc 


aggctgccaa 


ccccactctg 


gttcagctgc 


tgctggagct 


1260 


gccccgggga 


gacctgcgga 


cctttgtcaa 


catgaaggcc 


cacgggaaca 


cagccctcca 


1320 


catggcggct 


gccctgcccc 


ctgggccggc 


ccaggaggcc 


atcgtgcggc 


acctgttggc 


1380 


agctggggcg 


gaccccacac 


tgcgcaacct 


ggagaatgag 


cagcccgttc 


acctgctgcg 


1440 


gcccgggccg 


ggccctgagg 


ggctccggca 


gctgttgaag 


aggagccgtg 


tggcgccgcc 


1500 


aggcctgtcc 


tcttaggact 


caaacccaga 


ccctggactg 


attttccagt 


ccccaccgtc 


1560 
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ctgcgggaca gccagcgtat gctaatgttg caaacccatg ataatgtatg tggaatatcc 1620 

tgccattggg gttttacatt aaaaccccag aatggctgca gaggggtgaa caggccccaa 1680 

tatttggggt gctgtgatac ccctcttcta cccacaagga gccctcttga tgatttctgt 1740 

gaaatcgagg ccccttgatt gtttctgtga aacaccctgc acccctagtc ctttccccac 180 0 

tgagatcttt cgggttctct cccctaactc agct 1834 

<210> SEQ ID NO 13 
<211> LENGTH: 313 
<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 13 

Met Glu Ala Gly Pro Trp Arg Val Ser Ala Pro Pro Ser Gly Pro Pro 
15 10 15 

Gin Phe Pro Ala Val Val Pro Gly Pro Ser Leu Glu Val Ala Arg Ala 
20 25 30 

His Met Leu Ala Leu Gly Pro Gin Gin Leu Leu Ala Gin Asp Glu Glu 
35 40 45 

Gly Asp Thr Leu Leu His Leu Phe Ala Ala Arg Gly Leu Arg Trp Ala 
50 55 60 

Ala Tyr Ala Ala Ala Glu Val Leu Gin Val Tyr Arg Arg Leu Asp lie 
65 70 75 80 

Arg Glu His Lys Gly Lys Thr Pro Leu Leu Val Ala Ala Ala Ala Asn 
8 5 9 0 95 

Gin Pro Leu lie Val Glu Asp Leu Leu Asn Leu Gly Ala Glu Pro Asn 
100 105 110 

Ala Ala Asp His Gin Gly Arg Ser Val Leu His Val Ala Ala Thr Tyr 
115 120 125 

Gly Leu Pro Gly Val Leu Leu Ala Val Leu Asn Ser Gly Val Gin Val 
130 135 140 

Asp Leu Glu Ala Arg Asp Phe Glu Gly Leu Thr Pro Leu His Thr Ala 
145 150 155 160 

lie Leu Ala Leu Asn Val Ala Met Arg Pro Ser Asp Leu Cys Pro Arg 
165 170 * 175 

Val Leu Ser Thr Gin Ala Arg Asp Arg Leu Asp Cys Val His Met Leu 
180 185 " 190 

Leu Gin Met Gly Ala Asn His Thr Ser Gin Glu lie Lys Ser Asn Lys 
195 200 205 

Thr Val Leu His Leu Ala Val Gin Ala Ala Asn Pro Thr Leu Val Gin 
210 215 220 

Leu Leu Leu Glu Leu Pro Arg Gly Asp Leu Arg Thr Phe Val Asn Met 
225 230 235 240 

Lys Ala His Gly Asn Thr Ala Leu His Met Ala Ala Ala Leu Pro Pro 
245 250 255 

Gly Pro Ala Gin Glu Ala lie Val Arg His Leu Leu Ala Ala Gly Ala 
260 265 270 

Asp Pro Thr Leu Arg Asn Leu Glu Asn Glu Gin Pro Val His Leu Leu 
275 280 285 

Arg Pro Gly Pro Gly Pro Glu Gly Leu Arg Gin Leu Leu Lys Arg Ser 
29 0 2 95 3 00 



Arg Val Ala Pro Pro Gly Leu Ser Ser 
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305 310 

<210> SEQ ID NO 14 

<211> LENGTH: 304 9 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<4 00> SEQUENCE: 14 



taacgagctt 


ccttccaccg 


caaaagagct 


ggagaacaat 


gctaggcaac 


gtgctggaga 


60 


ccttggccct 


cggaacccaa 


gtcacgcctc 


ccatgtgagc 


tctggaggga 


gaactttatg 


120 


tgttgcactg 


agggcagtct 


ccggaaacgc 


gattcgcagc 


gggcgccgga 


agcggtgttg 


180 


tgtctgcagc 


tctggcagag 


gactgttcca 


ctagacacgc 


tgaagggact 


gggtacgtgt 


240 


tttccttcag 


gaccagagct 


gagaggagct 


gggatcgcgg 


cggcaatgga 


acgggcctca 


300 


gaaaggcgca 


cggccagcgc 


gctttttgcg 


gggttccggg 


ccttgggact 


tttcagcaac 


360 


gacattccac 


acgtggtgcg 


gttcagcgcg 


ctcaagcgcc 


ggttctatgt 


aacaacctgc 


420 


gtgggcaaga 


gtttccacac 


ctatgacgtt 


cagaaactta 


gtctggttgc 


agtaagtaat 


480 


tctgttccac 


aggatatctg 


ctgtatggca 


gctgatggca 


gattagtctt 


tgctgcttat 


540 


ggaaatgttt 


tctctgcatt 


tgcccgtaat 


aaagagatag 


tacatacctt 


taagggtcat 


600 


aaggcagaaa 


tccatttctt 


gcaacccttt 


ggagaccaca 


ttatctctgt 


tgatactgat 


660 


ggcattctta 


ttatttggca 


catatattca 


gaagaagaat 


acctgcagtt 


gacttttgat 


720 


aaatcagtat 


ttaaaatttc 


tgcaattttg 


catccaagta 


cctacttgaa 


taaaatactt 


780 


ctgggcagtg 


aacaaggaag 


cctgcagttg 


tggaatgtaa 


aatccaataa 


acttctatat 


840 


acatttccag 


gatggaaagt 


tggagtgaca 


gctcttcagc 


aggcaccagc 


cqtqqatqtt 


900 


gttgctattg 


gtcttatgtc 


aggtcaagtt 


atcattcaca 


acattaaatt 


taatgaaaca 


960 


ttaatgaagt 


ttcgtcaaga 


ctggggaccc 


attacttcaa 


tttcatttcg 


cacagatggt 


1020 


catccagtaa 


tqqcaqctqq 


aagcccatgt 


ggccatattg 


qactctqqqa 


tctagaagac 


1080 


aaaaaattaa 


tcaaccaaat 


gagaaatgca 


cactctacag 


caattgccgg 


actgacattt 


1140 


ctccatagag 


agccacttct 


tgtcacaaat 


ggcgctgaca 


atgctcttag 


gatatggata 


120 0 


tttgatggtc 


ctacaggtga 


aggccgactt 


ttgagattca 


gaatgggtca 


tagtgctcct 


1260 


cttaccaata 


tcagatatta 


tggacagaat 


ggacagcaga 


ttctaagtgc 


aagtcaagat 


1320 


ggaactcttc 


agtcattttc 


cacggtacat 


gaaaaattca 


ataagagctt 


gggacatgga 


1380 


ttaataaata 


aaaagagagt 


taaacgtaaa 


ggacttcaga 


ataccatgtc 


agtgagactt 


1440 


ccacccatca 


caaagtttgc 


agcagaggaa 


gctcgtgaaa 


gtgactggga 


tggtatcatt 


1500 


gcttgccatc 


aaggtaagct 


atcttgctca 


acctggaatt 


atcagaaatc 


tacaataggc 


1560 


gcttactttc 


tcaagccaaa 


agagttgaag 


aaagatgaca 


taactgcaac 


agcagtggat 


1620 


ataacttctt 


gtggaaactt 


tgctgtaatt 


ggcctctcat 


caggaactgt 


agatgtatat 


1680 


aacatgcagt 


ctggcataca 


tcgaggaagt 


tttggcaagg 


atcaagctca 


caagggatct 


1740 


gttagaggtg 


tcgcagtgga 


tggattaaac 


cagttgacag 


ttacaactgg 


tagtgaagga 


1800 


ttactcaaat 


tctggaactt 


taaaaacaaa 


attttaatcc 


attctgtgag 


cctcagttca 


1860 


tctccaaata 


tcatgttgct 


acatagggac 


agtggcattc 


tgggactcgc 


cttggatgac 


1920 


ttctccatta 


gtgttctgga 


catagaaact 


aggaagattg 


tcagagagtt 


ttctggacac 


1980 



US 2003/01 19024 Al Jun. 26, 2003 

55 



-continued 



caaggccaaa 


taaatgacat 


ggcttttagt 


cctgatggtc 


gttggttaat 


aaqtqctqcq 

Z) Z? ZJ ZJ 


2040 


atggattgct 


ctattaggac 


ttgggacctt 


ccttctgggt 


gccttataga 


ctgctttttg 


2100 


ttggactcgg 


ctcctctcaa 


tgtttctatg 


tctcctactg 


gagactttct 


ggcaacttcc 


2160 


catgtggacc 


accttggaat 


ttatctatgg 


tccaatattt 


ccctgtattc 


agttgtttca 


2220 


ttacggccac 


ttcctgcaga 


ttatgtccct 


tcaatagtca 


tgcttcctgg 


tacttgtcaa 


2280 


acccaagatg 


tagaagtatc 


agaagaaaca 


gtagaaccaa 


gtgatgaatt 


gatagaatat 


2340 


gattcgccag 


aacagttgaa 


tgagcaattg 


gtgactcttt 


cacttcttcc 


tgaatcacga 


2400 


tggaaaaacc 


ttcttaacct 


tgatgttatt 


aagaaaaaga 


ataaaccaaa 


ggaaccaccc 


2460 


aaagtaccca 


aatcagcacc 


atttttcatt 


ccaacaattc 


ctggccttgt 


acccagatat 


2520 


gctgcacctg 


aacaaaataa 


tgatccccag 


cagtctaaag 


tggtaaatct 


tggagttttg 


258 0 


gctcaaaaat 


cagatttctg 


cttgaaactt 


gaagaaggac 


tggtaaataa 


taagtatgac 


2640 


actgctctca 


accttctgaa 


agaatcaggc 


ccatcaggaa 


ttgaaacaga 


gctgcgaagc 


2700 


ttgtctcctg 


attgtggtgg 


gtccatagaa 


gttatgcaga 


gcttcttgaa 


aatgattggg 


2760 


atgatgctgg 


acagaaagcg 


tgattttgag 


ttagcccagg 


cataccttgc 


attgtttcta 


2820 


aagttacacc 


ttaaaatgct 


tccttcagag 


ccagtactcc 


tagaagaaat 


aacaaatttg 


2880 


tcatcccagg 


tggaagaaaa 


ctggacccat 


ttgcaatcac 


tcttcaatca 


aagcatgtgt 


2940 


attttaaatt 


atctcaaaag 


tgctttgttg 


taaaaataaa 


tttgtgacta 


aacaaagact 


3000 


ttcatattaa 


atgggttcaa 


ttgaactcat 


ttcttatttt 


ccaagtgtc 




3049 



<210> SEQ ID NO 15 

<211> LENGTH: 951 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<400> SEQUENCE: 15 

Met Cys Cys Thr Glu Gly Ser Leu Arg Lys Arg Asp Ser Gin Arg Ala 
15 10 15 

Pro Glu Ala Val Leu Cys Leu Gin Leu Trp Gin Arg Thr Val Pro Leu 
20 25 30 

Asp Thr Leu Lys Gly Leu Gly Thr Cys Phe Pro Ser Gly Pro Glu Leu 
35 4 0 ^ 45 

Arg Gly Ala Gly lie Ala Ala Ala Met Glu Arg Ala Ser Glu Arg Arg 
50 55 60 

Thr Ala Ser Ala Leu Phe Ala Gly Phe Arg Ala Leu Gly Leu Phe Ser 
65 70 75 80 

Asn Asp lie Pro His Val Val Arg Phe Ser Ala Leu Lys Arg Arg Phe 
8 5 9 0 95 

Tyr Val Thr Thr Cys Val Gly Lys Ser Phe His Thr Tyr Asp Val Gin 
100 105 110 

Lys Leu Ser Leu Val Ala Val Ser Asn Ser Val Pro Gin Asp lie Cys 
115 120 125 

Cys Met Ala Ala Asp Gly Arg Leu Val Phe Ala Ala Tyr Gly Asn Val 
130 135 140 

Phe Ser Ala Phe Ala Arg Asn Lys Glu lie Val His Thr Phe Lys Gly 
145 150 155 160 

His Lys Ala Glu lie His Phe Leu Gin Pro Phe Gly Asp His lie lie 
165 170 ' 175 
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Ser Val Asp Thr Asp Gly lie Leu lie lie Trp His lie Tyr Ser Glu 
180 185 ~ 190 

Glu Glu Tyr Leu Gin Leu Thr Phe Asp Lys Ser Val Phe Lys lie Ser 
195 200 205 

Ala lie Leu His Pro Ser Thr Tyr Leu Asn Lys lie Leu Leu Gly Ser 
210 215 220 

Glu Gin Gly Ser Leu Gin Leu Trp Asn Val Lys Ser Asn Lys Leu Leu 
225 230 235 240 

Tyr Thr Phe Pro Gly Trp Lys Val Gly Val Thr Ala Leu Gin Gin Ala 
245 250 255 

Pro Ala Val Asp Val Val Ala lie Gly Leu Met Ser Gly Gin Val lie 
260 265 270 

lie His Asn lie Lys Phe Asn Glu Thr Leu Met Lys Phe Arg Gin Asp 

275 280 285 

Trp Gly Pro lie Thr Ser lie Ser Phe Arg Thr Asp Gly His Pro Val 
290 295 300 

Met Ala Ala Gly Ser Pro Cys Gly His lie Gly Leu Trp Asp Leu Glu 
305 " 310 "* 315 320 

Asp Lys Lys Leu lie Asn Gin Met Arg Asn Ala His Ser Thr Ala lie 
325 330 335 

Ala Gly Leu Thr Phe Leu His Arg Glu Pro Leu Leu Val Thr Asn Gly 
340 345 350 

Ala Asp Asn Ala Leu Arg lie Trp lie Phe Asp Gly Pro Thr Gly Glu 
355 360 365 

Gly Arg Leu Leu Arg Phe Arg Met Gly His Ser Ala Pro Leu Thr Asn 
370 375 380 

lie Arg Tyr Tyr Gly Gin Asn Gly Gin Gin lie Leu Ser Ala Ser Gin 
385 390 395 400 

Asp Gly Thr Leu Gin Ser Phe Ser Thr Val His Glu Lys Phe Asn Lys 
405 410 415 

Ser Leu Gly His Gly Leu lie Asn Lys Lys Arg Val Lys Arg Lys Gly 
420 425 430 

Leu Gin Asn Thr Met Ser Val Arg Leu Pro Pro lie Thr Lys Phe Ala 
435 440 445 

Ala Glu Glu Ala Arg Glu Ser Asp Trp Asp Gly lie lie Ala Cys His 
450 455 460 

Gin Gly Lys Leu Ser Cys Ser Thr Trp Asn Tyr Gin Lys Ser Thr lie 
465 470 475 480 

Gly Ala Tyr Phe Leu Lys Pro Lys Glu Leu Lys Lys Asp Asp lie Thr 
485 ~ 490 495 

Ala Thr Ala Val Asp lie Thr Ser Cys Gly Asn Phe Ala Val lie Gly 
500 505 510 

Leu Ser Ser Gly Thr Val Asp Val Tyr Asn Met Gin Ser Gly lie His 
515 520 525 

Arg Gly Ser Phe Gly Lys Asp Gin Ala His Lys Gly Ser Val Arg Gly 
530 535 540 

Val Ala Val Asp Gly Leu Asn Gin Leu Thr Val Thr Thr Gly Ser Glu 
545 550 555 560 

Gly Leu Leu Lys Phe Trp Asn Phe Lys Asn Lys lie Leu lie His Ser 
5 65 570 575 
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Val Ser Leu Ser Ser Ser Pro Asn lie Met. Leu Leu His Arg Asp Ser 
580 585 590 

Gly lie Leu Gly Leu Ala Leu Asp Asp Phe Ser lie Ser Val Leu Asp 
595 600 605 

lie Glu Thr Arg Lys lie Val Arg Glu Phe Ser Gly His Gin Gly Gin 
610 615 620 

lie Asn Asp Met Ala Phe Ser Pro Asp Gly Arg Trp Leu lie Ser Ala 
625 630 635 ~ 640 

Ala Met Asp Cys Ser lie Arg Thr Trp Asp Leu Pro Ser Gly Cys Leu 
645 650 655 

lie Asp Cys Phe Leu Leu Asp Ser Ala Pro Leu Asn Val Ser Met Ser 
660 665 670 

Pro Thr Gly Asp Phe Leu Ala Thr Ser His Val Asp His Leu Gly lie 
675 680 685 

Tyr Leu Trp Ser Asn lie Ser Leu Tyr Ser Val Val Ser Leu Arg Pro 
690 695 700 

Leu Pro Ala Asp Tyr Val Pro Ser lie Val Met Leu Pro Gly Thr Cys 
705 710 715 720 

Gin Thr Gin Asp Val Glu Val Ser Glu Glu Thr Val Glu Pro Ser Asp 
725 730 735 

Glu Leu lie Glu Tyr Asp Ser Pro Glu Gin Leu Asn Glu Gin Leu Val 
740 745 750 

Thr Leu Ser Leu Leu Pro Glu Ser Arg Trp Lys Asn Leu Leu Asn Leu 
755 760 765 

Asp Val lie Lys Lys Lys Asn Lys Pro Lys Glu Pro Pro Lys Val Pro 
77 0 7 75 7 80 

Lys Ser Ala Pro Phe Phe lie Pro Thr lie Pro Gly Leu Val Pro Arg 
785 790 795 800 

Tyr Ala Ala Pro Glu Gin Asn Asn Asp Pro Gin Gin Ser Lys Val Val 
805 810 815 

Asn Leu Gly Val Leu Ala Gin Lys Ser Asp Phe Cys Leu Lys Leu Glu 
820 825 830 

Glu Gly Leu Val Asn Asn Lys Tyr Asp Thr Ala Leu Asn Leu Leu Lys 
835 ^ 840 845 

Glu Ser Gly Pro Ser Gly lie Glu Thr Glu Leu Arg Ser Leu Ser Pro 
850 ' 855 860 

Asp Cys Gly Gly Ser lie Glu Val Met Gin Ser Phe Leu Lys Met lie 
865 870 875 880 

Gly Met Met Leu Asp Arg Lys Arg Asp Phe Glu Leu Ala Gin Ala Tyr 
885 890 895 

Leu Ala Leu Phe Leu Lys Leu His Leu Lys Met Leu Pro Ser Glu Pro 
900 905 910 

Val Leu Leu Glu Glu lie Thr Asn Leu Ser Ser Gin Val Glu Glu Asn 
915 920 925 

Trp Thr His Leu Gin Ser Leu Phe Asn Gin Ser Met Cys lie Leu Asn 
930 935 940 

Tyr Leu Lys Ser Ala Leu Leu 
945 950 



<210> SEQ ID NO 16 
<211> LENGTH: 4617 
<212> TYPE: DNA 
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<213> ORGANISM: Homo sapiens 
<4 00> SEQUENCE: 16 



agatttaagt 


aagtcttccc 


caacaccgaa 


tgggattcca 


tcttcagacc 


cagccagcga 


60 


tgccatggac 


cccttccatg 


cttgcagtat 


tcttaagcaa 


ctcaaaacaa 


tgtacgatga 


120 


aggacagttg 


acagacattg 


tagtggaagt 


ggatcacggg 

ZJ ZJ ZJ ZJ ZJ 


aaaacatttt 


cctgtcatag 


180 


aaacgttctt 


gctgcaatca 


gcccttactt 


cagatccatg 


ttcactagcg 


gccttacaga 


240 


aagtactcaa 


aaagaagttc 


gaatagttgg 


tgttgaagct 


gaatcgatgg 


atttagtgtt 


300 


gaactatgcc 


tacacttcca 


gagttattct 


tacagaggcc 


aatgttcaag 


ccttgttcac 


360 


tgcagctagc 


atcttccaga 


ttccttccat 


ccaagaccaa 


tgtgctaagt 


atatgatcag 


420 


tcatttggac 


ccacagaatt 


ctattggggt 


ctttatcttt 


gctgatcatt 


atggtcatca 


480 


ggaactcgga 


gatcgatcaa 


aagaatacat 


tcgtaaaaag 


tttctgtgtg 


tcaccaaaga 


540 


acaagagttt 


ctccagttga 


caaaagacca 


actgataagt 


atactagaca 


gtgacgattt 


600 


aaatgtagac 


cgagaagagc 


atgtttatga 


aagcattata 


aggtggtttg 


agcatgaaca 


660 


gaatgaaaga 


gaagtgcacc 


ttccagaaat 


ttttgctaaa 


tgcatacgtt 


ttcctctgat 


720 


ggaagatacc 


tttatagaga 


aaattccacc 


tcagtttgca 


caggctatag 


ccaaaagctg 


780 


tgtagaaaag 


ggaccatcca 


acaccaatgg 


ctgtacacag 


aggcttggaa 


tgactgcttc 


840 


tgaaatgatc 


atatgttttg 


atgctgccca 


caaacactca 


ggaaagaagc 


aaacagtgcc 


900 


ttgtctagat 


atagtcacag 


gaagggtgtt 


taaactatgc 


aaaccaccaa 


atgacctgag 


960 


agaagttggg 


attcttgtat 


caccagataa 


tgacatttac 


attgcaggag 


ggtacaggcc 


1020 


aagcagcagt 


gaggtctcca 


tcgaccataa 


ggcagaaaat 


gatt-tctgga 


tgtatgatca 


1080 


ttccaccaat 


agatggctat 


ccaaaccatc 


cttgcttcga 


gccagaatag 


gctgcaaact 


1140 


tgtctattgc 


tgtggtaaaa 


tgtatgcaat 


cqqaqqtcqt 


gtttatgaag 


qtqatqqqaq 


1200 


aaactcacta 


aaatctgttg 


agtgctacga 


cagtagagag 


aattgttgga 


cgactgtttg 


1260 


cgcgatgcca 


qttqcaatqq 

ZJ ZJ ZJ ZJ 


aatttcataa 


tgctgtggag 

Z) Z) ZJ z} zj 


tacaaagaga 


agatctatgt 


1320 


tttacaggga 


gaattttttc 


tcttctatga 


gcctcaaaaa 


qactactqqq 

3 3 3 3 


gtttcttaac 


1380 


ccccatgact 


gtgcctagaa 


tccagggctt 


agcagctgta 


tacaaggact 


ctatctacta 


1440 


catagctgga 


acctgtggaa 


atcatcaacg 


tatgtttact 


gtagaagcct 


atgatattga 


1500 


gctaaataaa 


tggactcgta 


agaaagactt 


tccatgtgat 


cagtccataa 


atccatacct 


1560 


taaactggta 


cttttccaga 


acaaactcca 


tttatttgtt 


cgagctactc 


aagtgactgt 


1620 


tgaagaacac 


gtcttcagaa 


ccagcagaaa 


aaattccctt 


taccaatatg 


atgacattgc 


1680 


tgaccagtgg 


atgaaagtgt 


atgagacccc 


agatcggctc 


tgggaccttg 


gccggcattt 


1740 


tgaatgtgct 


gttgctaaac 


tgtatcctca 


gtgtcttcag 


aaagtactct 


aaatgagtag 


1800 


caggccttag 


tgcatcactg 


gcatctcatt 


cttaggaaac 


ttgtctttga 


tacaaaagag 


1860 


tgctgacagt 


atttcagaaa 


gctgagagag 


ttttatacat 


ggaaaatggg 


tatgcttaaa 


1920 


gattgcaggg 


tagggaggga 


ttttccttca 


tccttgtgac 


atttcatttc 


agtaaggaaa 


1980 


agataacaaa 


gtgcaattat 


cagcattttt 


ttttcctggc 


ataaaattaa 


tcatttcatt 


2040 


ttataatttt 


gtgataaata 


gtaactgagg 


taccagatga 


atcaggacaa 


ctatgcactc 


2100 


ttataagagc 


atttagggta 


ttattgggta 


aagacgtcta 


aacttgtttg 


atgtgacttt 


2160 
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taattttaaa 


tacgggtaac 


aatctgaggc 


aatatcacta 


ggactttagc 


tgtgacctct 


2220 


ctaacacaga 


gaagcactaa 


cttagatcct 


cattcttaat 


atttatatgt 


atctattttt 


2280 


gtgtactgtt 


ttcaagtgta 


ctgagattta 


aatgtgttct 


attattagag 


tagatcgaag 


2340 


aaaaaattag 


tctcagaaag 


agcttttagt 


ctgattgttt 


ccatttccca 


tgtaatttta 


2400 


agttaagcta 


aagttttaaa 


gtggcagttt 


tctgtcgatg 


actttttcaa 


gtgctaacac 


2460 


tgtctctttt 


gtgaaaatct 


ggaaaagtgc 


tcatattcac 


aggtggctgg 


tgctagtcta 


2520 


acttaattca 


tgtgtataac 


tagatggatt 


taaatggtct 


gagcctatgc 


ctatctttca 


2580 


aattggtgtg 


gatttcatgg 


ccatagtact 


ttacctgttg 


aactcttgtg 


atttcacaag 


2640 


attctctact 


tatgtgatag 


gagggtatgg 


ccagttattc 


atctaactgg 


actcaatctt 


2700 


agaatagtag 


gaacattata 


cccagtttgc 


actaacatgg 


gccatttgta 


gcccaacctt 


2760 


ctcttccatc 


tacctgtcca 


ttcattattg 


gtacaaggaa 


aggtaactta 


tttctcttct 


2820 


gcacagagca 


taatgtgaag 


ttttatacct 


acttttaaaa 


ttctgctttc 


cagaaacaaa 


2880 


attcctgcag 


tggtctaatt 


taatgtcttt 


aagtttcata 


ttacaattaa 


aacctcattt 


2940 


tttttttcca 


tttttgcact 


taacagtgat 


gaatactttt 


acgttggaat 


cctccttcta 


3000 


gctgaaggtg 


attgaaaagg 


aaaagagtga 


gtgaacagaa 


ccatagcttt 


ctaggtacta 


3060 


aagcattttt 


tgcatttaac 


tgatgaaatt 


tctaacaatc 


atcagttagg 


aatattaaca 


3120 


tgaaggataa 


accaacttat 


ttgtatacct 


aaggcaggca 


tttggatcag 


taacatgttt 


3180 


tactaagcct 


agagtaattc 


gtaaagggta 


taagcatagg 


acagattttg 


ccctcaatca 


3240 


caatatttgt 


attcacttga 


aagcaaactg 


gcatggttcg 


tattttaaaa 


atcttgcaca 


3300 


aattgtaatg 


tgatactgtg 


aaacaaattg 


aaaacattgc 


ctctttgcat 


cacatacctc 


3360 


gtttttcaga 


aactttccaa 


actgctttac 


atagacctct 


acaagtaggg 


aatgttttct 


3420 


gaagcagaag 


ttaaaatgga 


cagcatttct 


agaattaaca 


ttttaaaatc 


tagtcttagc 


3480 


tagatatgtg 


gtttcttctt 


attggtgttg 


atagtatgtc 


tgtaatctct 


gtataaactt 


3540 


tgtcaacatt 


tttacctccc 


cagttttatc 


ttctgttttg 


tttttgtttt 


tatcatcatg 


3600 


atgttttgga 


gttattactg 


tgtattttag 


aaatcattct 


ttacagtttt 


gcattgctga 


3660 


ggagagagaa 


aaaacaattt 


ttttgcaaga 


gatgttcatg 


taatttattt 


ttgaaagctt 


3720 


tgttgaataa 


gatttcctgc 


cgctttttga 


caatcttgtg 


tatttagaaa 


aatgtattac 


3780 


ttgaaaacat 


gacatagaac 


attgagttag 


caatttacat 


gggctgtatg 


ttatataaga 


3840 


gaatgacata 


ctgtggctaa 


ttcaacagta 


gatttattct 


tttagcctgc 


acaacagttg 


3900 


atcttttggc 


tatgacaatt 


tgtatggagg 


gtacgatcta 


agttaagtgt 


gtcaaaagca 


3960 


aggcttagga 


tttgttatgg 


gagtagaata 


tatattgaat 


tttgtatgaa 


gaactatttg 


4020 


tttaaattat 


atagctggga 


tattttgcca 


ctgttaaaat 


ggattcagaa 


gaggtcctag 


4080 


aaaagtaaga 


ttagtgacat 


gtgtgggttt 


atatttagat 


atttaaggtg 


cattttcata 


4140 


gtgtggtaag 


accttaagta 


aaaggcacaa 


tgggtactac 


agaattaaaa 


tgtaggtcta 


4200 


acataatgcc 


agttccactt 


taactttgtt 


tttgcatttg 


aagaatgtat 


gtagcacttt 


4260 


cctatatatt 


tgtcacacat 


tgaaaactgg 


actgggtata 


actatgttat 


aggaaagtag 


4320 


aaattgtatt 


ctttattttc 


catctttgtt 


ttctgttcta 


caaagttgat 


gcttaagcat 


4380 


caagctgatt 


ttattggtca 


tgagaacaaa 


tggatgtgat 


catgaaggaa 


tcagattccc 


4440 
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tatgtaaagc agtttaaaat ggaattcaat gttcagtgct caggtatgta gtaagtactg 450 0 
tagtcctgtg ggggcaaatg tgtagatatt tttaaacatt ttgccataat tgcacaattt 456 0 
tttgcatttt tacctgatgt cattgtttct tataataaaa ccttttctga ttgaaaa 4617 

<210> SEQ ID NO 17 

<211> LENGTH: 575 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<4 0 0> SEQUENCE: 17 

Met Asp Pro Phe His Ala Cys Ser lie Leu Lys Gin Leu Lys Thr Met 
15 10 15 

Tyr Asp Glu Gly Gin Leu Thr Asp lie Val Val Glu Val Asp His Gly 
20 25 30 

Lys Thr Phe Ser Cys His Arg Asn Val Leu Ala Ala lie Ser Pro Tyr 
35 40 45 

Phe Arg Ser Met Phe Thr Ser Gly Leu Thr Glu Ser Thr Gin Lys Glu 
50 55 60 

Val Arg lie Val Gly Val Glu Ala Glu Ser Met Asp Leu Val Leu Asn 
65 70 75 80 

Tyr Ala Tyr Thr Ser Arg Val lie Leu Thr Glu Ala Asn Val Gin Ala 
85 90 95 

Leu Phe Thr Ala Ala Ser lie Phe Gin lie Pro Ser lie Gin Asp Gin 
100 105 110 

Cys Ala Lys Tyr Met lie Ser His Leu Asp Pro Gin Asn Ser lie Gly 
115 120 125 

Val Phe lie Phe Ala Asp His Tyr Gly His Gin Glu Leu Gly Asp Arg 
130 135 140 

Ser Lys Glu Tyr lie Arg Lys Lys Phe Leu Cys Val Thr Lys Glu Gin 
145 150 155 160 

Glu Phe Leu Gin Leu Thr Lys Asp Gin Leu lie Ser lie Leu Asp Ser 
165 170 175 

Asp Asp Leu Asn Val Asp Arg Glu Glu His Val Tyr Glu Ser lie lie 
180 185 190 

Arg Trp Phe Glu His Glu Gin Asn Glu Arg Glu Val His Leu Pro Glu 
195 200 205 

lie Phe Ala Lys Cys lie Arg Phe Pro Leu Met Glu Asp Thr Phe lie 
210 215 220 

Glu Lys lie Pro Pro Gin Phe Ala Gin Ala lie Ala Lys Ser Cys Val 
225 230 235 240 

Glu Lys Gly Pro Ser Asn Thr Asn Gly Cys Thr Gin Arg Leu Gly Met 
245 250 255 

Thr Ala Ser Glu Met lie lie Cys Phe Asp Ala Ala His Lys His Ser 
260 265 270 

Gly Lys Lys Gin Thr Val Pro Cys Leu Asp lie Val Thr Gly Arg Val 
275 280 285 

Phe Lys Leu Cys Lys Pro Pro Asn Asp Leu Arg Glu Val Gly lie Leu 
290 295 300 

Val Ser Pro Asp Asn Asp lie Tyr lie Ala Gly Gly Tyr Arg Pro Ser 
305 310 315 320 

Ser Ser Glu Val Ser lie Asp His Lys Ala Glu Asn Asp Phe Trp Met 
325 330 335 
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Tyr Asp His Ser Thr Asn Arg Trp Leu Ser Lys Pro Ser Leu Leu Arg 
340 345 35 0 

Ala Arg lie Gly Cys Lys Leu Val Tyr Cys Cys Gly Lys Met Tyr Ala 
355 360 365 

lie Gly Gly Arg Val Tyr Glu Gly Asp Gly Arg Asn Ser Leu Lys Ser 
370 375 380 

Val Glu Cys Tyr Asp Ser Arg Glu Asn Cys Trp Thr Thr Val Cys Ala 
385 390 395 400 

Met Pro Val Ala Met Glu Phe His Asn Ala Val Glu Tyr Lys Glu Lys 
405 410 415 

lie Tyr Val Leu Gin Gly Glu Phe Phe Leu Phe Tyr Glu Pro Gin Lys 
420 425 430 

Asp Tyr Trp Gly Phe Leu Thr Pro Met Thr Val Pro Arg lie Gin Gly 
435 440 445 

Leu Ala Ala Val Tyr Lys Asp Ser lie Tyr Tyr lie Ala Gly Thr Cys 
45 0 ~ 455 " 4 60 

Gly Asn His Gin Arg Met Phe Thr Val Glu Ala Tyr Asp lie Glu Leu 
465 470 475 ^ " 480 

Asn Lys Trp Thr Arg Lys Lys Asp Phe Pro Cys Asp Gin Ser lie Asn 
485 490 495 

Pro Tyr Leu Lys Leu Val Leu Phe Gin Asn Lys Leu His Leu Phe Val 
500 505 510 

Arg Ala Thr Gin Val Thr Val Glu Glu His Val Phe Arg Thr Ser Arg 
515 520 525 

Lys Asn Ser Leu Tyr Gin Tyr Asp Asp lie Ala Asp Gin Trp Met Lys 
530 535 540 

Val Tyr Glu Thr Pro Asp Arg Leu Trp Asp Leu Gly Arg His Phe Glu 
545 550 555 560 

Cys Ala Val Ala Lys Leu Tyr Pro Gin Cys Leu Gin Lys Val Leu 



<210> SEQ ID NO 18 

<211> LENGTH: 358 8 

<212> TYPE: DNA 

<213> ORGANISM: Homo sapiens 

<4 0 0> SEQUENCE: 18 

ctggagactg gaaggtccaa gatcaagata ctacagattt gatttctgga cgttgaacat 6 0 

ggtgtaggag tagaaaagca acagggacgg aaggagagaa cttacccctt caagcccttt 120 

tataaggcac taaatcccat cattgagggc agagtcctca tagcctaatc acctcctaaa 180 

tgctccattt cttaatattg ttgcactgag gattaagctt caacatgaat tctgaagagg 24 0 

acacaaacat ccaaaccata gcagtcaatg ccttagccct tgatgttgct atcaacctga 300 

gattcgggga tcaaggaagg acaggtaata gttaacctct tctgtgagaa gtcagaaggt 360 

gatctcttta atgctttctt tttaagaatt tttcaaattg agactaattg cagaggttcc 42 0 

agttgaccag cattcatagg aatgaagaca aacacagaga tggtgtgtct aagaaacttc 48 0 

aaaaggtgta gacctcctga ctgaagcata ttggatttat ttaatttttt tcactgtatt 54 0 

tctgtcctcc tacaagggaa agtcatgatt acactaactg agctaaaatg cttagcagat 60 0 

gcccagtcat cttatcacat cttaaaacca tggtgggacg tcttctggta ttacatcaca 660 
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ctgatcatgc 


tgctggtggc 


cgtgctggcc 


ggagct.ct.cc 


agctgacgca 


gagcagggtt 


720 


ctgtgctgtc 


ttccatgcaa 


agtggaattt 


gacaatcact 


gtgccgtgcc 


ttgggacatc 


780 


ctgaaagcca 


gcatgaacac 


atcctctaat 


cctgggacac 


cgcttccgct 


ccccctccga 


840 


attcagaatg 


acctccaccg 


acagcagtac 


tcctatattg 


atgccgtctg 


ttacgagaaa 


900 


cagctccatt 


ggtttgcaaa 


gtttttcccc 


tatctggtgc 


tcttgcacac 


gctcatcttt 


960 


gcagcctgca 


gcaacttttg 


gcttcactac 


cccagtacca 


gttccaggct 


cgagcatttt 


1020 


gtggccatcc 


ttcacaagtg 


cttcgattct 


ccatggacca 


cccgcgccct 


ttcagaaaca 


1080 


gtggctgagc 


agtcagtgag 


gcctctgaaa 


ctctccaagt 


ccaagatttt 


gctttcgtcc 


1140 


tcagggtgtt 


cagctgacat 


agattccggc 


aaacagtcat 


tgccctaccc 


acagccaggt 


1200 


ttggagtcag 


ctggcataga 


aagcccaact 


tccagtgtcc 


tggacaagaa 


ggagggtgaa 


1260 


caggccaaag 


ccatctttga 


aaaagtgaaa 


agattccgca 


tgcatgtgga 


gcagaaggac 


1320 


atcatttata 


gagtatatct 


gaaacagata 


atagtcaaag 


tcattttgtt 


tgtgctcatc 


1380 


ataacttatg 


ttccgtattt 


tttaacccac 


atcactcttg 


aaatcgactg 


ttcagttgat 


1440 


gtgcaggctt 


ttacaggata 


taagcgctac 


cagtgtgtct 


attccttggc 


agaaatcttt 


1500 


aaggtcctgg 


cttcatttta 


tgtcattttg 


gttatacttt 


atggtctgac 


ctcttcctac 


1560 


agcctgtggt 


ggatgctgag 


gagttccctg 


aagcaatatt 


cctttgaggc 


gttaagagaa 


1620 


aaaagcaact 


acagtgacat 


ccctgatgtc 


aagaatgact 


ttgccttcat 


ccttcatctg 


1680 


gctgatcagt 


atgatcctct 


ttattccaaa 


cgcttctcca 


tattcctatc 


agaggtcagt 


1740 


gagaacaaac 


tgaaacagat 


caacctcaat 


aatgaatgga 


cagttgagaa 


actgaaaagt 


1800 


aagcttgtga 


aaaatgccca 


ggacaagata 


gaactgcatc 


tttttatgct 


caacggtctt 


1860 


ccagacaatg 


tctttgagtt 


aactgaaatg 


gaagtgctaa 


gcctggagct 


tatcccagag 


1920 


gtgaagctgc 


cctctgcagt 


ctcacagctg 


gtcaacctca 


aggagcttcg 


tgtgtaccat 


1980 


tcatctctgg 


tcgtagacca 


tcctgcactg 


gcctttctag 


aggagaattt 


aaaaatcctc 


2040 


cgcctgaaat 


ttactgaaat 


gggaaaaatc 


ccacgctggg 


tatttcacct 


caagaatctc 


2100 


aaggaacttt 


atctttcggg 


ctgtgttctc 


cctgaacagt 


tgagtactat 


gcagttggag 


2160 


ggctttcagg 


acttaaaaaa 


tctaaggacc 


ctgtacttga 


agagcagcct 


ctcccggatc 


2220 


ccacaagttg 


ttacagacct 


cctgccttca 


ttgcagaaac 


tgtcccttga 


taatgaggga 


2280 


agcaaactgg 


ttgtgttgaa 


caacttgaaa 


aagatggtca 


atctgaaaag 


cctagaactg 


2340 


atcagctgtg 


acctggaacg 


catcccacat 


tccattttca 


gcctgaataa 


tttgcatgag 


2400 


ttagacctaa 


gggaaaataa 


ccttaaaact 


gtggaagaga 


tcattagctt 


tcagcatctt 


2460 


cagaatcttt 


cctgcttaaa 


gttgtggcac 


aataacattg 


cttatattcc 


tgcacagatt 


2520 


ggggcattat 


ctaacctaga 


gcagctctct 


ttggaccata 


ataatattga 


gaatctgccc 


2580 


ttgcagcttt 


tcctatgcac 


taaactacat 


tatttggatc 


taagctataa 


ccacttgacc 


2640 


ttcattccag 


aagaaatcca 


gtatctgagt 


aatttgcagt 


actttgctgt 


gaccaacaac 


2700 


aatattgaga 


tgctaccaga 


tgggctgttt 


cagtgcaaaa 


agctgcagtg 


tttacttttg 


2760 


gggaaaaata 


gcttgatgaa 


tttgtcccct 


catgtgggtg 


agctgtcaaa 


ccttactcat 


2820 


ctggagctca 


ttggtaatta 


cctggaaaca 


cttcctcctg 


aactagaagg 


atgtcagtcc 


2880 


ctaaaacgga 


actgtctgat 


tgttgaggag 


aacttgctca 


atactcttcc 


tctccctgta 


2940 
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acagaacgtt 


tacagacgtg 


cttagacaaa 


tgttgactta 


aagaaaagag 


acccgtgttt 


3000 


caaaatcatt 


tttaaaagta 


tgctcggccg 


ggcgtggtgg 


ctcatgccta 


taatcccagc 


3060 


actttgggag 


gccaagatgg 


gcggattgct 


tgaggtcagg 


agttcgagac 


cagtctggcc 


3120 


aacctggtga 


aaccccatct 


ctgctaaaac 


tacaaaaaaa 


ttagccaggc 


gtggtggcgt 


3180 


gcgcctgtaa 


tcccagctac 


ttgggaggct 


gacgcagggg 


aattgcttga 


accagggagg 


3240 


tggaggttgc 


agtgagccga 


gattgtgcca 


ctgtacacca 


gcctgggtga 


cagagcaaga 


3300 


ctcttatctc 


aaaaaaaaaa 


aaaaatgctc 


cagggcttta 


aatgagaagt 


aaaattttct 


3360 


aagttaataa 


agatgaagaa 


tgggtgacta 


ttatgatgaa 


ccataactaa 


atgtcttatt 


3420 


aaagcaactg 


agtgtctagc 


cctaaattaa 


ccaggtaaaa 


actgttaaca 


ctaacctgaa 


3480 


gttttgtgaa 


taactgttct 


ttaacttatt 


gagatgttgc 


aagaaatgca 


catccagggt 


3540 


ggactgggag 


ctatgaaatg 


actaaattcc 


tccttgcagt 


gtttacct 




3588 



<210> SEQ ID NO 19 

<211> LENGTH: 803 

<212> TYPE: PRT 

<213> ORGANISM: Homo sapiens 

<4 0 0> SEQUENCE: 19 

Met lie Thr Leu Thr Glu Leu Lys Cys Leu Ala Asp Ala Gin Ser Ser 
15 10 15 

Tyr His lie Leu Lys Pro Trp Trp Asp Val Phe Trp Tyr Tyr lie Thr 
20 25 30 

Leu lie Met Leu Leu Val Ala Val Leu Ala Gly Ala Leu Gin Leu Thr 
35 40 45 

Gin Ser Arg Val Leu Cys Cys Leu Pro Cys Lys Val Glu Phe Asp Asn 
50 55 ~ 6 0 

His Cys Ala Val Pro Trp Asp lie Leu Lys Ala Ser Met Asn Thr Ser 
65 70 75 80 

Ser Asn Pro Gly Thr Pro Leu Pro Leu Pro Leu Arg lie Gin Asn Asp 
85 90 95 

Leu His Arg Gin Gin Tyr Ser Tyr lie Asp Ala Val Cys Tyr Glu Lys 
100 105 110 

Gin Leu His Trp Phe Ala Lys Phe Phe Pro Tyr Leu Val Leu Leu His 
115 ^12 0 ^ 125 

Thr Leu lie Phe Ala Ala Cys Ser Asn Phe Trp Leu His Tyr Pro Ser 
130 135 140 

Thr Ser Ser Arg Leu Glu His Phe Val Ala lie Leu His Lys Cys Phe 
145 150 155 160 

Asp Ser Pro Trp Thr Thr Arg Ala Leu Ser Glu Thr Val Ala Glu Gin 
165 170 175 

Ser Val Arg Pro Leu Lys Leu Ser Lys Ser Lys lie Leu Leu Ser Ser 
180 ~ 185 190 

Ser Gly Cys Ser Ala Asp lie Asp Ser Gly Lys Gin Ser Leu Pro Tyr 
195 200 205 

Pro Gin Pro Gly Leu Glu Ser Ala Gly lie Glu Ser Pro Thr Ser Ser 
210 215 220 

Val Leu Asp Lys Lys Glu Gly Glu Gin Ala Lys Ala lie Phe Glu Lys 
225 ~ 230 235 240 

Val Lys Arg Phe Arg Met His Val Glu Gin Lys Asp lie lie Tyr Arg 
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245 250 255 

Val Tyr Leu Lys Gin lie lie Val Lys Val lie Leu Phe Val Leu lie 
260 265 270 

lie Thr Tyr Val Pro Tyr Phe Leu Thr His lie Thr Leu Glu lie Asp 
275 280 285 

Cys Ser Val Asp Val Gin Ala Phe Thr Gly Tyr Lys Arg Tyr Gin Cys 
290 295 300 

Val Tyr Ser Leu Ala Glu lie Phe Lys Val Leu Ala Ser Phe Tyr Val 
305 310 315 320 

lie Leu Val lie Leu Tyr Gly Leu Thr Ser Ser Tyr Ser Leu Trp Trp 
325 330 335 

Met Leu Arg Ser Ser Leu Lys Gin Tyr Ser Phe Glu Ala Leu Arg Glu 
340 345 350 

Lys Ser Asn Tyr Ser Asp lie Pro Asp Val Lys Asn Asp Phe Ala Phe 
355 360 365 

lie Leu His Leu Ala Asp Gin Tyr Asp Pro Leu Tyr Ser Lys Arg Phe 
37 0 3 75 3 80 

Ser lie Phe Leu Ser Glu Val Ser Glu Asn Lys Leu Lys Gin lie Asn 
385 390 395 400 

Leu Asn Asn Glu Trp Thr Val Glu Lys Leu Lys Ser Lys Leu Val Lys 

4 05 410 ~ 415 

Asn Ala Gin Asp Lys lie Glu Leu His Leu Phe Met Leu Asn Gly Leu 
420 425 430 

Pro Asp Asn Val Phe Glu Leu Thr Glu Met Glu Val Leu Ser Leu Glu 
435 440 445 

Leu lie Pro Glu Val Lys Leu Pro Ser Ala Val Ser Gin Leu Val Asn 
450 455 460 

Leu Lys Glu Leu Arg Val Tyr His Ser Ser Leu Val Val Asp His Pro 
465 470 475 480 

Ala Leu Ala Phe Leu Glu Glu Asn Leu Lys lie Leu Arg Leu Lys Phe 
485 490 495 

Thr Glu Met Gly Lys lie Pro Arg Trp Val Phe His Leu Lys Asn Leu 
500 505 510 

Lys Glu Leu Tyr Leu Ser Gly Cys Val Leu Pro Glu Gin Leu Ser Thr 
515 ~ 520 525 

Met Gin Leu Glu Gly Phe Gin Asp Leu Lys Asn Leu Arg Thr Leu Tyr 
530 535 540 

Leu Lys Ser Ser Leu Ser Arg lie Pro Gin Val Val Thr Asp Leu Leu 
545 550 555 560 

Pro Ser Leu Gin Lys Leu Ser Leu Asp Asn Glu Gly Ser Lys Leu Val 

5 65 570 575 

Val Leu Asn Asn Leu Lys Lys Met Val Asn Leu Lys Ser Leu Glu Leu 

580 ~ 585 " 590 

lie Ser Cys Asp Leu Glu Arg lie Pro His Ser lie Phe Ser Leu Asn 
595 600 605 

Asn Leu His Glu Leu Asp Leu Arg Glu Asn Asn Leu Lys Thr Val Glu 
610 615 620 

Glu lie lie Ser Phe Gin His Leu Gin Asn Leu Ser Cys Leu Lys Leu 
625 630 635 640 

Trp His Asn Asn lie Ala Tyr lie Pro Ala Gin lie Gly Ala Leu Ser 
645 650 " 655 
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Asn Leu Glu Gin Leu Ser Leu Asp His Asn Asn lie Glu Asn Leu Pro 
660 665 670 

Leu Gin Leu Phe Leu Cys Thr Lys Leu His Tyr Leu Asp Leu Ser Tyr 
675 680 685 

Asn His Leu Thr Phe lie Pro Glu Glu lie Gin Tyr Leu Ser Asn Leu 
690 695 700 

Gin Tyr Phe Ala Val Thr Asn Asn Asn lie Glu Met Leu Pro Asp Gly 
705 710 715 720 

Leu Phe Gin Cys Lys Lys Leu Gin Cys Leu Leu Leu Gly Lys Asn Ser 

725 ~ 730 ~ 735 

Leu Met Asn Leu Ser Pro His Val Gly Glu Leu Ser Asn Leu Thr His 
740 745 750 

Leu Glu Leu lie Gly Asn Tyr Leu Glu Thr Leu Pro Pro Glu Leu Glu 

755 760 765 

Gly Cys Gin Ser Leu Lys Arg Asn Cys Leu lie Val Glu Glu Asn Leu 
77 0 7 75 7 80 

Leu Asn Thr Leu Pro Leu Pro Val Thr Glu Arg Leu Gin Thr Cys Leu 
785 790 795 800 

Asp Lys Cys 



What is claimed is: 

1. A purified protein comprising the amino acid sequence 
of SEQ ID NO: 17. 

2. An isolated nucleic acid comprising the nucleotide 
sequence of SEQ ID NO: 16, or a coding region thereof, or 
the complement of any of the foregoing. 

3. The isolated nucleic acid of claim 2 which is DNA. 

4. An isolated nucleic acid comprising a nucleotide 
sequence encoding the protein of claim 1, or the complement 
thereof. 

5. A recombinant cell containing the nucleic acid of claim 
2, in which the nucleotide sequence is under the control of 
a promoter heterologous to the nucleotide sequence. 

6. A recombinant cell containing a nucleic acid vector that 
comprises the nucleic acid of claim 2. 

7. An antibody that binds to a protein consisting of the 
amino acid sequence of SEQ ID NO: 17. 

8. The antibody of claim 7 which is monoclonal. 

9. A molecule comprising a fragment of the antibody of 
claim 7, which fragment binds a protein consisting of the 
amino acid sequence of SEQ ID NO: 17. 

10. A method of producing a protein comprising: 

growing a recombinant cell containing the nucleic acid of 
any one of claims 2-4 in which said nucleotide 
sequence is under the control of a promoter heterolo- 
gous to said nucleotide sequence, such that the protein 
encoded by said nucleic acid is expressed by the cell; 
and 

recovering said expressed protein. 

11. An isolated protein that is the product of the process 
of claim 10. 

12. A pharmaceutical composition comprising a therapeu- 
tically effective amount of the protein of claim 1, and a 
pharmaceutically acceptable carrier. 



13. A pharmaceutical composition comprising a therapeu- 
tically effective amount of the nucleic acid of claim 2; and 
a pharmaceutically acceptable carrier. 

14. A pharmaceutical composition comprising a therapeu- 
tically effective amount of the recombinant cell of claim 5 or 
claim 6; and a pharmaceutically acceptable carrier. 

15. A pharmaceutical composition comprising a therapeu- 
tically effective amount of an antibody that binds to a protein 
comprising the amino acid sequence of claim 1, and a 
pharmaceutically acceptable carrier. 

16. A method of measuring the level of T cell activation 
in a subject, comprising: 

contacting a sample comprising mRNA or nucleic acid 
derived therefrom from a subject, with a nucleic acid 
probe that hybridizes to a nucleic acid that encodes the 
protein of claim 1 under conditions conducive to 
hybridization; and 

measuring the amount of said probe that hybridizes to 
nucleic acid in the sample; wherein the amount of 
hybridization is indicative of the level of T cell acti- 
vation. 

17. A method of measuring the level of T cell activation 
in a subject, comprising: 

contacting a sample derived from a patient with an 
antibody that binds the protein of claim 1, under 
conditions conducive to immunospecific binding; and 

measuring the amount of any immunospecific binding by 
the antibody wherein the amount of said immunospe- 
cific binding is indicative of the level of T cell activa- 
tion. 



